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Preface 


This book is based on a decade of our collaborative work on new models, 
methods, and empirical applications of age-period-cohort (APC) analysis. The 
identification and statistical estimation of classical APC multiple classifica- 
tion/accounting models—often termed the APC "conundrum"—have been 
a challenging analytic problem in demography, epidemiology, sociology, and 
the other social sciences for about four decades. The last great synthesis of 
APC methodology for the social sciences and demography was based on the 
work of William M. Mason and Stephen E. Fienberg in the 1970s and 1980s 
and presented in their 1985 book Cohort Analysis in Social Research: Beyond the 
Identification Problem (New York: Springer-Verlag). 

The Mason-Fienberg synthesis so dominated these disciplines in the 1980s 
and 1990s that relatively few new contributions to APC methodology were 
published in these disciplines during these years. Some APC methodologi- 
cal work continued in epidemiology, however, and around the year 2000, 
new interest emerged in demography and the social sciences. One of us 
entered the doctoral program at Duke University in that year, and the other 
became aware of Wenjiang Fu's initial work in the early 2000s on the intrinsic 
estimator as a new approach to the identification and estimation of the APC 
accounting model. We then teamed up with Fu in a 2004 article on statistical 
properties and empirical applications of the intrinsic estimator. 

This initial work on the intrinsic estimator led us to think more generally 
about APC analysis. The classical accounting model was formulated for a 
research design typically consisting of an age-by-time period table of popu- 
lation rates or proportions with a single observation per cell. However, new 
research designs that permit new classes of statistical models had emerged 
and produced new datasets for APC analysis by the year 2000. One of these 
is the repeated cross-sectional sample survey design in which data are 
obtained from individual members of a representative sample of a popu- 
lation repeatedly in a sequence over a number of years. When we initially 
studied some published APC analyses of data from repeated sample sur- 
veys, we found that they applied the classical APC accounting model. But 
this model does not take full advantage of the statistical power of the numer- 
ous individual observations within a specific cohort and time period in a 
repeated survey design. 

To do so, we were driven toward a hierarchical APC (HAPC) specifica- 
tion in the form of cross-classified models in which the individual observa- 
tions in repeated cross-sectional surveys are nested within time periods and 
cohorts. These models can be specified in mixed (fixed and random) effects 
or purely fixed effects forms. However, the mixed effects forms of HAPC 
models have both statistical and substantive advantages. Importantly, HAPC 
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models avoid the underidentification problems of the classical APC account- 
ing model and can be specified as linear mixed models (LMMs) for continu- 
ous, relatively bell-shaped (Gaussian) outcome variables or as generalized 
linear mixed models (GLMMs) for discrete, nonnormally distributed (non- 
Gaussian) outcomes. These specifications permit us to take advantage of 
the many developments in the statistical theory and methodology of mixed 
models and associated computer software in the past three decades, devel- 
opments that were not available to APC analysts in the 1970s and the 1980s. 
Ourinitialarticles on the statistical methodology and empirical applications 
of HAPC models of the LMM and GLMM classes were published in 2006 
and 2008. Most recently, we extended the reach of the HAPC approach to 
many other areas of research using APC analysis, such as the joint applica- 
tion of the mixed effects models and heteroscedastic regression in a study of 
trends in self-reported health with Hui Zheng and the use of HAPC models 
for the aggregate population rates data design in the case of cancer incidence 
and mortality that we illustrate in the book. These extensions to different 
directions and datasets are opening up a new genre of APC analyses with 
great potential. 

On recognizing the nested nature of the individual-level observations in 
repeated cross-section survey designs and the HAPC modeling framework 
to which it led us, we turned our sights to a third research design from which 
a number of datasets began to emerge in the 1990s and 2000s: the accelerated 
longitudinal panel study design in which an initial wave of study partici- 
pants is repeatedly surveyed across a number of subsequent time periods. 
What makes this design "accelerated" is the presence of study participants 
from a number of cohorts in the initial and subsequent waves. This permits 
the analysis of age-by-cohort and other cross-level interactions within the 
HAPC-GLMM framework that we developed for the repeated cross-sectional 
study design and also avoids the classical identification conundrum. 

In sum, the approaches that we have developed synthesize APC models 
and methods for these three research designs—age-by-time period tables of 
population rates or proportions, repeated cross-section sample surveys, and 
accelerated longitudinal panel studies—within a single, consistent HAPC- 
GLMM statistical modeling framework. Many approaches to APC analysis, 
including pure fixed effects approaches such as that of the APC accounting 
model, are special cases of this general system. And, by recognizing this, 
analyses of datasets can be conducted by application of alternative specifica- 
tions within this frame with the resulting empirical estimates compared for 
consistency across models, a form of sensitivity analysis. We emphasize that 
we do not claim to have "solved" the APC analysis problem in any of the 
work we have done. On the other hand, approaches to APC analysis can be 
arrayed according to their statistical properties, with some models and meth- 
ods having better properties than others. By this criterion, the models we 
have developed and describe in this book are relatively good. We believe that 
their empirical application to many different substantive problems will lead to 
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many fascinating new findings about how various outcome variables develop 
along the age, period, and cohort dimensions. And additional developments 
in APC statistical models and methods will be forthcoming, including varia- 
tions in the HAPC-GLMM family of models, as the analytic problems posed 
by APC analysis continue to stimulate new approaches and as new models, 
methods, and computational algorithms are developed in statistics. 

The general objective of this book is to bring our work together in one 
place. We build on our prior articles and include new technical discussions 
of statistical issues and many new empirical applications. Additional details 
on many of the published articles and empirical analyses cited in the book as 
well as computer software and sample programs to estimate the models can 
be found on the web page http://www.unc.edu/~yangy819/apc/index.html. 

Finally, we thank our collaborators on issues of APC analysis, including 
those who contributed to prior publications, especially Wenjiang J. Fu, Sam 
Schulhofer-Wohl, and Hui Zheng, and those who have assisted with data 
analyses featured in this book that are part of ongoing research projects, 
including Ting Li, a mathematical demographer and specialist in the bio- 
demography of aging; and Steven Frenk, a medical sociologist with diverse 
interests. Both of them joined the Lineberger Comprehensive Cancer Center 
and Carolina Population Center at the University of North Carolina in 2011 
as postdoctoral fellows working with the lead author (Y Y) and have con- 
tributed with the highest levels of rigor and dedication to the synergy of the 
research team and various projects associated with the APC analysis, cancer, 
and aging. We thank Igor Akushevich, senior research scientist in the Center 
for Population Health and Aging of the Duke Population Research Institute, 
who provided assistance with cancer incidence and mortality data prepara- 
tion. We also thank the students who have taken courses on cohort analysis 
and demographic methods that we taught over the years, asked interesting 
questions that prompt us to do a better job at explicating various methods 
with examples and additional materials, and provided their new perspec- 
tives both conceptually and analytically on this old problem. It has truly 
been intellectually stimulating and a pleasure to work with them. 


Yang Yang 
University of North Carolina at Chapel Hill 


Kenneth C. Land 
Duke University 
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Introduction 


Demographers, epidemiologists, and social scientists often deal with tempo- 
rally ordered datasets, that is, population or sample survey data in the form 
of observations or measurements on individuals or groups/populations of 
individuals that are repeated or ordered along a time dimension. In this 
context, a long-standing analytic problem is the conceptualization, estima- 
tion, and interpretation of the differential contributions of three time-related 
changes to the phenomena of interest, namely, the effects of differences in the 
ages of the individuals at the time of observation on an outcome of interest, 
termed age (A) effects; the effects of differences in the time periods of observa- 
tion or measurement of the outcome, termed period (P) effects; and the effects 
of differences in the year of birth or some other shared life events for a set 
of individuals, termed cohort (C) effects. To address this problem, researchers 
need to compare age-specific data recorded at different points in time and 
from different cohorts. A systematic study of such data is termed age-period- 
cohort (APC) analysis. APC analysis has the unique ability to depict parsimo- 
niously the entire complex of social, historical, and environmental factors 
that simultaneously affect individuals and populations of individuals. It has 
thus been widely used to address questions of enduring importance to the 
studies of social change, etiology of diseases, aging, and population pro- 
cesses and dynamics. 

The distinct meanings of A, P, and C effects will be elaborated and become 
more concrete in specific contexts. As a first specification, consider the defini- 
tion of these terms in the context of aging and human development across 
the life course, health, and chronic disease epidemiology (Yang 2007, 2009, 
2010). In this context, the following applies: 


Age effects are variations associated with chronological age groups. 
They can arise from physiological changes, accumulation of social 
experience, social role or status changes, or a combination of these. 
Age effects therefore reflect biological and social processes of aging 
internal to individuals and represent developmental changes across 
the life course. This can clearly be seen in the considerable regulari- 
ties of age variations across time and space in many outcomes, such 
as fertility, schooling, employment, marriage and family structure, 
disease prevalence and incidence, and mortality. 


2 Age-Period-Cohort Analysis 


Period effects are variations over time periods or calendar years that 
influence all age groups simultaneously. Period effects subsume 
a complex set of historical events and environmental factors, such 
as world wars, economic expansions and contractions, famine and 
pandemics of infectious diseases, public health interventions, and 
technology breakthroughs. Shifts in social, cultural, economic, or 
physical environments may in turn induce similar changes in the 
lives of all individuals at a point in time. Thus, period effects are 
evident from a correspondence in timing of changes in events and 
social and epidemiologic conditions that influence these events. For 
example, the decrease in lung cancer mortality in the United States 
after 1990 followed reductions in tar and nicotine yield per cigarette 
and increases in smoking cessation in earlier years (Jemal, Chu, and 
Tarone 2001). In addition to these direct effects, there may also be 
changes in disease classification or diagnostic techniques that affect 
the incidence of, or mortality from, certain diseases. For example, the 
increase in the slope of the period trend of U.S. female breast cancer 
mortality in the 1980s coincided with the marked increase in breast 
cancer incidence due to expanded use of diagnosis via mammogra- 
phy (Tarone, Chu, and Gaudette 1997). 


Cohort effects are changes across groups of individuals who experience 
an initial event such as birth or marriage in the same year or years. 
Birth cohorts are the most commonly examined unit of analysis 
in APC analysis. A birth cohort moves through life together and 
encounters the same historical and social events at the same ages. 
Birth cohorts that experience different historical and social condi- 
tions at various stages of their life course therefore have diverse 
exposures to socioeconomic, behavioral, and environmental risk 
factors. Cohort effects are evident in many cancer sites, chronic dis- 
eases, and human mortality. An in-depth discussion of the concept 
of cohort effects is given in the next chapter. 


The challenges posed by APC analysis are well known. Whether observed 
time-related changes can be distilled out and separated into aging, time 
period, and cohort components is a question usually deemed conceptually 
important but empirically intractable. It has been termed the “conundrum” 
of APC analysis (Glenn 2005: 20) for two reasons. The first is data limita- 
tions. Using cross-sectional data at one point in time, for example, aging and 
cohort effects are intermingled and confounded. Using longitudinal panel 
data for a single cohort, on the other hand, aging and period effects are 
intermingled and confounded. The second reason is the use of conventional 
linear regression models that suffer from either specification errors or an 
identification problem and consequently are incapable of distinguishing A, 
P, and C effects. 
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The identification problem has been a topic of intense discussion and 
research since the 1970s. This led to a synthesis of APC methodology for the 
social sciences and demography based on the work of William M. Mason 
and Stephen E. Fienberg in the 1970s and 1980s (Fienberg and Mason 1979; 
Mason and Fienberg 1985). The Mason-Fienberg synthesis so dominated 
these disciplines in the 1980s and 1990s that relatively few new contributions 
to APC methodology were published in these decades. By comparison, APC 
methodology continued to be of interest in epidemiology, within which sev- 
eral new graphical and analytic methods were published during this period. 

Although a variety of approaches has been proposed to solve the APC 
conundrum, each has limitations. Yet another challenge is a criticism often 
lodged against general-purpose methods of APC analysis, namely, they 
provide no avenue for testing specific, substantive, and mechanism-based 
hypotheses and thus are mere accounting devices of algebraic convenience 
that may be misleading. This leads to the question: What should an analyst 
do to model APC data in empirical research to further an understanding of 
the social and biological mechanisms generating the data? Since the year 
2000, new interest in APC models and methods has emerged in the social sci- 
ences to address this question. This includes a series of studies by us as well 
as works by others exemplified in a special issue of the Sociological Methods & 
Research (36(3) February 2008). 

The major objective of this book is to present new APC models, methods, 
and empirical applications. Statistics has continued to develop as a disci- 
pline since the Mason-Fienberg synthesis of 1985. New statistical models 
and new computationally intensive estimation methods have been devel- 
oped (e.g., mixed [fixed and random] effects models, Markov chain Monte 
Carlo methods). For another, datasets with new research designs that invite 
or even require the analysis of separate age, period, and cohort components 
of change are available. Accordingly, we seek to show some ways in which 
these statistical models and methods and research designs can be applied to 
open new possibilities for APC analysis. We aim to articulate and compare 
new and extant models and methods that can be widely used by analysts. We 
also aim to provide some useful guidelines on how to conduct APC analysis. 
In doing so, this book intends to make two essential contributions to quan- 
titative studies of time-related change. First, through the introduction of the 
generalized linear mixed model (GLMM) framework, we show how innova- 
tive estimation methods and new model specifications resolve the ^model 
identification problem" that has hampered the development of APC analysis 
for the past decades. Second, we address the major criticism against the util- 
ity of APC analysis by explaining and demonstrating the use of new models 
within the GLMM framework to uncover the mechanisms underlying age 
patterns and temporal trends in phenomena of interest to researchers. We 
achieve these goals through both methodological expositions and empirical 
studies. For empirical illustrations, we draw examples on a wide variety of 
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disciplines, such as sociology, demography, and epidemiology but focus on 
aging, longevity, and health disparities. We do not, however, claim that the 
new models and methods presented here are “solutions” to the APC analysis 
problem in any absolute sense. As articulated in Chapter 4, the classical APC 
identification problem in tabular arrays of population rates or proportions 
is a member of a class of structural underidentification problems for which 
there can never be a "complete" resolution. 

The contents of the volume are as follows: Chapter 2 discusses the concep- 
tualization of cohort effects and theoretical rationale for the importance of 
cohort analysis. Chapter 3 introduces prototypical datasets to be analyzed 
in further detail in subsequent chapters that characterize the application 
of APC analysis in three common research designs. Chapter 4 lays out the 
formal algebra of the APC analysis conundrum, reviews some conventional 
approaches to this problem, and sketches a GLMM framework that we use to 
organize the new families of models and methods. 

Chapter 5 focuses on an innovation within the conventional linear regres- 
sion models: the Intrinsic Estimator (IE) as a new method of coefficient 
estimation. Chapter 6 introduces a three-step procedure for APC analysis 
through empirical studies of U.S. cancer incidence and mortality trends by 
sex and race. It also illustrates the utility of APC models in demographic 
projections and forecasts through an empirical APC analysis and construc- 
tion of the associated implied projections of cancer mortality in the period 
2010—2029. As part of the methodological exposition of the nature and utili- 
ties of the IE method, we include in this chapter algebraic details of its sta- 
tistical properties with proofs (Section 5.3; Appendices 5.1-5.3) and model 
validation through Monte Carlo simulation analysis (Section 5.5). We also 
include computational algorithms for obtaining the prediction intervals for 
forecasting (Appendix 6.1). Readers not adept with or interested in advanced 
statistical methods can skip these sections. 

Chapters 7 and 8 introduce the mixed effects models for APC analysis using 
the hierarchical APC (HAPC) models. We emphasize two breakthroughs of 
this type of models compared to the linear fixed effects models classically 
used in APC analysis: contextualization of individual lives within cohorts and 
periods, which avoids the model identification problem, and incorporation 
of additional covariates, which allows for mechanism-based hypothesis testing. 
We illustrate in Chapter 7 the application of these models in studies of ver- 
bal ability trends in the United States and changing sex and race disparities 
in obesity. In Chapter 8 we analyze the social inequalities of happiness in 
relation to macroeconomic conditions and cohort characteristics and cancer 
mortality rates in relation to known risk factors and diagnostic and treat- 
ment factors. We also discuss in Chapter 8 extensions to HAPC models such 
as the full Bayesian estimation for small sample size problems and conjunc- 
tion with the heteroscedastic regression for ascertainment of between-group 
and within-group variations. Readers who are not statistically sophisticated 
can skip these extensions in Sections 8.4 and 8.5. 
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Chapter 9 develops a similar GLMM approach to the analysis of prospective 
panel data using accelerated longitudinal cohort designs. Through empiri- 
cal examples in studies of social stratification of aging and health, we show 
how to model age trajectories and cohort variations using HAPC-growth 
curve models. Chapter 10 concludes the volume with recaps of new avenues 
for APC analysis presented in previous chapters and suggestions for future 
directions of methodological research and data collection. 

To facilitate the application of the methods described in the volume (in 
Chapters 5-9), we have developed a companion World Wide Web page on 
APC analysis (http://www.unc.edu/~yangy819/apc/index.html). This page 
provides links to PDF files of major methodological and substantive arti- 
cles on APC analysis we reference in the book. It also provides sample codes 
using existing general-purpose statistical software packages, including R, 
SAS, and Stata. These are connected to the empirical analyses reported in 
the book. 
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2 


Why Cohort Analysis? 


2.1 Introduction 


Although studies of time-related change have long existed in the history 
of science, those that consider cohort change as distinct from age and time 
period variations appear in scholarly literature only relatively recently. Why 
is cohort analysis useful? Examples of the utility of cohort analysis can be 
found in demographic studies of human mortality. Descriptive analyses 
of nineteenth century English death rates clearly indicated stronger regu- 
larities in birth cohort changes relative to period variations (Derrick 1927; 
Kermack, McKendrick, and McKinlay 1934). The relevance of this approach 
was then recognized in subsequent epidemiologic investigations, the earli- 
est of which is the well-known study of tuberculosis mortality conducted 
by Frost (1940) that emphasized the influence of early life conditions, rather 
than current conditions, on development of a disease that has long latency. 
The usefulness of cohort analysis demonstrated by these early studies and 
the convenience of using simple indicators that are widely available in many 
kinds of data facilitated the quick spread of cohort analysis in demography 
and epidemiology. Although age-period-cohort (APC) analysis took root in 
these two fields relatively independently of one another, their common inter- 
ests in health, mortality, and longevity and similarities in the development of 
analytic techniques unite them as one cottage industry. 


2.2 The Conceptualization of Cohort Effects 


The applicability of APC analysis relies on the substantive importance of 
cohort influences (Hobcraft, Menken, and Preston 1982). APC analysis is, 
in this sense, synonymous with cohort analysis (Smith 2008). Among vari- 
ous cohorts defined by different initial events (such as marriage and college 
entrance), birth cohorts are the most commonly examined unit of analysis. 
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We introduce here the conceptualization of birth cohort effects and the 
importance of identifying such effects in the contexts of different fields of 
studies that are concerned with time change. 

Norman Ryder, in a seminal article published in 1965, articulated the con- 
ceptualrelevance of birth cohort to the study of social historical change. First, 
a birth cohort moves through life together and encounters the same histori- 
cal and social events at the same ages. Cohort effects then reflect formative 
experiences resulting from the intersection of individual biographies and 
macrosocial influences. Second, the succession of birth cohorts with different 
life experiences, termed demographic metabolism by Ryder, constantly changes 
the composition of the population and transforms the society. Therefore, 
cohorts can be conceived as the essence of social change. Third, cohort mem- 
bership could be considered as a social structural category that has an ana- 
lytic utility similar to that of social class. They both have explanatory power 
because they are surrogate indices of common characteristics of individuals 
in each category. Comparisons of historical cohorts can thus be useful in 
addressing an extraordinary range of substantive issues in social research. 
For instance, a recent analysis of Census 2000 data by Hughes and O’Rand 
(2004) compared the baby boomers (born 1946-1964) with their predeces- 
sors born earlier in the twentieth century who were defined by events and 
experiences unique to their times: young Progressives (1906-1915), Jazz Age 
babies (1916-1925), Depression kids (1926-1935), and war babies (1936-1945). 
Differences in socioeconomic attainments, marriage, fertility, and family 
structure across these cohorts are substantial and reflect the post-World War 
II transformation of American society. 

Cohort effects may also arise from differentials in early life conditions. 
This conceptualization is particularly relevant in the examination of health 
and illness wherein long-term exposure to risk factors is the major cause 
of the disease. Chronic disease epidemiology has long noted the impor- 
tance of early life exposures in explaining the susceptibility to disease and 
mortality later in adulthood (Ben-Shlomo and Kuh 2002). The fetal origin 
hypothesis, for instance, argues that malnutrition in utero and during infancy 
adversely affects intrauterine growth and postnatal development and may 
increase the risks of cardiovascular and respiratory diseases, cancer, non- 
insulin-dependent diabetes, metabolic syndromes, and mortality (Barker 
1998). The cohort morbidity phenotype hypothesis further links reductions in 
early life exposures to inflammatory infection to cohort declines in cardio- 
vascular and overall mortalities (Finch and Crimmins 2004). The theory of 
technophysio evolution (Fogel and Costa 1997) also implies that throughout the 
twentieth century, individuals' health capital has changed with the year of 
birth. More recent cohorts fared substantially better in the initial endowment 
of health capital at birth and have lower depreciation rates in that stock of 
health capital. These led to improved physiological capacities in later cohorts 
that also bode well for effectiveness of medical treatments. This is consistent 


Why Cohort Analysis? 9 


with mounting evidence in the recent demographic literature that shows 
successive birth cohorts experience later onset of chronic diseases and dis- 
abilities (Crimmins, Reynolds, and Saito 1999; Freedman and Martin 1998). 

Studies of aging use cohort analysis to assess changes in individual out- 
comes in relation to aging within or across birth cohorts. Sociological theories 
of aging and the life course emphasize that the way individual lives unfold 
with age is largely shaped by social historical context (Elder 1974). Examining 
cohort membership as a contextual characteristic has important implications 
for a better understanding of the heterogeneous experiences of aging. Riley 
(1987) articulated the cohort-specific aging process and advanced the prin- 
ciple of cohort differences in aging. A birth cohort shares the same birth year and 
ages together. Because members in different birth cohorts age in unique ways 
shaped by the disparate sociohistorical and epidemiologic conditions, each 
cohort experiences a distinct life course. For example, recent studies on old- 
age depression found evidence of cohort heterogeneity in mental health and 
an apparent age-by-cohort interaction effect whereby depression declined 
with age more rapidly for earlier cohorts (Kasen et al. 2003; Yang 2007). Just 
as the cohort perspective enhances the understanding of the aging pro- 
cess, the aging perspective also sheds light on the mechanism underlying 
cohort effects. That is, cohort differences in social propensities and biological 
capacities are affected not only by early life but also lifelong accumulation 
of exposures. This conceptualization of cohort effects has been advanced by 
Ryder (1965) and referred to as the continuously accumulating cohort effects by 
Hobcraft, Menken, and Preston (1982). Hobcraft and colleagues further sug- 
gested the use of cohort analysis to capture the process by which the imprint 
of past events is differentiated by age and becomes embodied in cohorts dif- 
ferentially. We discuss this cohort theory in further detail and its mathemati- 
cal realization in the final chapter of this book. 


2.3 Distinguishing Age, Period, and Cohort 


Because the objective of cohort analysis is to identify the source of varia- 
tions attributable to birth cohort, it is imperative first to distinguish cohort 
from potential confounding factors, including age and period. To under- 
stand problems associated with inadequate data and study designs, we give 
an example in Table 2.1 that illustrates the relationship of birth cohort with 
chronological age and calendar year in any given dataset. In this example, 
data are arrayed by four ages (a60, a70, a80, and a90) for each of the four peri- 
ods p1980, p1990, p2000, and p2010. Birth cohort membership (c) is shown by 
diagonals within the age-by-period array and ranges between 1890 and 1950. 
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TABLE 2.1 


Hypothetical Data Arrayed by Age, 
Period and Cohort 


p1980  p1990 p2000 p2010 


a60 c1920_  c1930  c1940 c1950 
a70  c1910  Ác1920  c1930  c1940 
a80 | c1900  Ác1910  Ác1920  c1930 
a90 | c1890  c1900  Ác1910 c1920 


The problem with using cross-sectional data gathered at one point in time 
is the confounding of age and cohort changes. This can be seen by mov- 
ing down the rows within columns of Table 2.1. For example, in year 1980, 
changes across ages 60 to 90 are indistinguishable from changes across 
cohorts 1920 to 1890. If these changes are solely age related, then the same 
age pattern should hold in subsequent years 1990 to 2010, which subsume 
different sets of cohorts. If these changes are also a function of cohort dif- 
ferences, then the age pattern of 1980 should not resemble those of the other 
years containing different cohorts. Therefore, the detection of age and cohort 
effects can only be achieved when comparing data collected at different 
points in time. Similarly, comparing data from multiple time periods for the 
same age group precludes distinguishing between period and cohort effects. 
This kind of analysis is less common in practice, however. 

The consequence of using a cross-sectional design to infer age, period, or 
cohort effects can lead to erroneous inferences. For example, early epidemio- 
logic studies of age patterns of breast cancer incidence in the 1940s suggested 
a "break" in the age curve produced by a fall in the incidence between ages 
45 and 50 known as the "Clemmesen's hook" (Clemmesen 1948). While some 
later authors interpreted this as an age-related change due to hormonal or 
other physiological factors, MacMahon (1957) suggested that this break was 
an artifact produced by cross-sectional studies of age changes that actually 
were due to cohort changes. Comparing age-specific incidence data from 
female populations in Connecticut and England arrayed along the period 
and cohort lines, MacMahon found the break in the increasing age pattern of 
incidence rates arrayed by periods (MacMahon 1957: Figure 3) but no break 
during the menopausal age span for any cohort (MacMahon 1957: Figure 5). 
This, suggested by the study, is because changes in incidence between suc- 
cessive cohorts occurred in just those cohorts necessary to explain the obser- 
vation of the break at certain ages in particular time periods. 

The problem with using a single-cohort longitudinal panel is the con- 
founding of age and period effects. This can be seen by moving across cells 
within the cohort diagonals of the array shown. Suppose the cohort of inter- 
est is that born in 1920 first assessed in year 1980 at the age of 60. The cohort 
aged to 70, 80, and 90 at each subsequent 10-year follow-up in years 1990, 
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2000, and 2010, respectively. Therefore, within each cohort it is difficult to 
disentangle whether changes across the cells are age related, time period 
related, or both. It also is not possible to ascertain the extent to which the 
age trajectories identified for a single cohort have patterns shared with other 
cohorts as compared to being unique to the cohort studied. 

This confounding of age, period, and cohort effects in single-cohort lon- 
gitudinal studies is problematic whether the data studied are in the form of 
tables of population occurrence/exposure rates often studied by demogra- 
phers and epidemiologists or in the form of individual-level observations of 
the cohort members measured repeatedly over time as the cohort ages. As 
an illustration of the latter, consider the Hamil-Luker, Land, and Blau (2004) 
study of latent life course trajectories of cocaine use among a national repre- 
sentative sample of youths aged 14-16 initially surveyed in 1979 that was fol- 
lowed as a cohort with surveys in 1984, 1988, 1992, 1994, and 1998. Figure 2.1 
depicts three different trajectories of cocaine usage among the cohort of ado- 
lescents grouped into the "delinquents" latent cluster. One of these shows a 
clear peak of reported cocaine use in the 1980s when sample members were 
in their 20s. Members of this trajectory grouping then “age out" to lower 
rates of cocaine use in their 30s. A second cluster exhibits a steady decline 
of reported cocaine use from the late teens through the 20s into the 30s. The 
third trajectory consists of youths with comparatively low rates of reported 
cocaine use throughout the ages/time periods studied. 

While this study was the first to conduct such a thorough latent class and 
latent trajectory analysis of cocaine use across the adolescent and young 
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Delinquents’ predicted probability of cocaine usage by latent cluster membership, N = 244. 
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adult years, it could not distinguish age, period, and cohort components in 
the trajectories it identified and estimated. The reason is that the age and 
time dimensions on the horizontal axis of Figure 2.1 are identical to each 
other, and thus age/life course developmental aspects of the trajectories can- 
not be separated from those of the time periods of the follow-up surveys. 
In addition, because the study was based on a single cohort, the possible 
effects of different cohort socialization and life course experiences cannot be 
deciphered. Thus, while it may be the case that all recent cohorts of youths 
in the United States have similar latent clusters and latent trajectories of 
cocaine use, the extent to which the trajectories were affected by the societal 
milieu of cocaine experimentation and use in the 1980s and early 1990s can- 
not be ascertained. 


2.4 Summary 


In empirical investigations confined to changes in age and/or period trends, 
the cohort effect frequently is ignored or assumed nonexistent. The assump- 
tion of no cohort effects can greatly simplify analysis, but its tenability can 
be called into question when such effects are present. 

Cohort analysis aims at distinguishing age, period, and cohort effects and 
is theoretically important in three ways. First, it is crucial for attributions 
of etiology or social causation. Conceptually, age effects represent aging- 
related developmental changes within individuals, whereas temporal trends 
across time periods reflect exogenous changes in social and epidemiologic 
conditions. And, cohort changes reflect the intersection of both as a result 
of differential accumulation of lifetime exposures to environmental condi- 
tions. Second, cohort analysis also relates to the generalizability of research 
findings. In the absence of period and cohort effects, age changes are broadly 
applicable across individuals in different time periods and/or cohorts. The 
presence of either or both of these effects, however, indicates the existence 
of exogenous forces or exposures that are period and cohort specific. Third, 
to the extent that these effects serve as aggregates and proxies for differ- 
ent sets of structural correlates, analyses that allow for their distinction are 
especially valuable for better understanding and identifying the underlying 
social and environmental factors that are amenable to modifications. 

In sum, because cohort analysis has the capacity to depict the entire com- 
plex of social, historical, and environmental factors that shape individual life 
courses parsimoniously, its importance for constructing and refining theory, 
measurement, and analysis can hardly be overstated. In spite of these theo- 
retical merits and conceptual relevance, empirical cohort analysis has been 
hampered by data limitations and methodological challenges. We now turn 
to these issues in the chapters that follow. 
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APC Analysis of Data from Three 
Common Research Designs 


3.1 Introduction 


As previously indicated, the goal of age-period-cohort (APC) analysis is to 
distinguish and statistically estimate the unique effects associated with age, 
period, and cohort. The extent to which this goal can be realized depends on 
research designs and modeling strategies. In this chapter, we focus on three 
research designs that allow analysts to distinguish age, period, and cohort 
effects and thus are more suitable for APC analysis than cross-sectional or 
single-cohort panel designs. We identify two to three prototypical datasets 
that characterize the application of APC analysis in each of three designs. In 
each case, we commence with statements of substantive problems that arise 
from the scientific literature and then describe data that will be analyzed 
in subsequent chapters to address these problems. We draw examples on 
a wide variety of topics throughout the book but focus on aging, longevity, 
and health disparities, as these are problems that have long histories of tem- 
poral analysis and for which APC analysis is highly salient. 


3.2 Repeated Cross-Sectional Data Designs 


We illustrated in Chapter 2 how age and cohort changes are confounded in 
cross-sectional data collected at one point in time and therefore do not per- 
mit cohort analysis. Cross-sectional data collected repeatedly across time, 
however, are well suited for APC analysis. Pooling data of all years, one can 
formulate a rectangular age-by-period array of observations, where columns 
correspond to age-specific observations in each year, and rows are obser- 
vations from each age across years. Linking the diagonal cells of the array 
yields the observations belonging to people born during the same calendar 
years who age together. 
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Period 


FIGURE 3.1 
Age-by-time period data structure. 


A generic form of the age-by-time period array is shown in Figure 3.1. In 
this rectangular array, age-specific observations such as mortality rates or 
proportions of a population with a certain attribute (e.g., having a "very 
good" or "excellent" self-reported health status) are summarized in an 
age-by-period table for age groups and time periods that are of equal 
interval lengths (e.g., 5 or 10 years). The diagonal elements of the matrix 
correspond to observations of birth cohorts. One cohort is indicated by 
the shaded area bounded by the dashed arrows. A numerical example 
of the relationship between A, P, and C in this kind of data structure, 
namely, C = P - A, has been shown previously in Chapter 2. Although 
only a longitudinal panel study design provides data from true birth 
cohorts that follow identical individuals over time, the design illustrated 
in Table 3.1, if based on complete population data or repeated represen- 
tative sample surveys thereof, allows for a classic demographic analysis 
using the synthetic cohort approach (Mason and Fienberg 1985; Preston, 
Heuveline, and Guillot 2001) that traces essentially the same groups of 
people from the same birth cohorts over a large segment of the life span. 
The composition of cohorts in this setting may be affected somewhat by 
international migration. 
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TABLE 3.1 
Age-Specific Lung Cancer Incidence Rates (per 100,000 Population) for White 
Males and White Females, United States 1973-2008 
White Males 
Period 
Age 1973-74 1975-79 1980-84 1985-89 1990-94 1995-99 2000-04 2005-08 
20-24 0.39 0.22 0.22 0.27 0.20 0.14 0.27 0.33 
25-29 0.67 0.52 0.43 0.23 0.38 0.35 0.58 0.51 
30-34 2.15 2.04 1.58 1.68 1.35 1.08 1.07 1.01 
35-39 7.61 7.43 6.64 5.28 4.37 3.46 3.42 2.63 
40-44 23.33 20.83 19.58 16.93 11.89 10.27 10.80 8.71 
45-49 56.24 60.38 51.09 43.76 33.59 26.09 26.39 22.10 
50-54 104.38 115.14 111.54 103.73 78.85 62.54 57.13 50.63 
55-59 182.69 187.61 201.56 194.98 154.78 130.15 118.75 99.07 
60-64 272.44 298.26 301.73 311.32 280.51 235.78 227.68 187.87 
65-69 383.62 407.04 435.12 418.41 419.10 371.68 358.45 317.09 
70-74 470.01 501.01 538.41 540.25 503.63 483.21 487.61 434.58 
75-79 485.08 542.78 586.05 588.58 572.95 537.63 569.73 541.55 
80-84 403.33 477.56 551.72 580.48 589.73 549.94 556.07 561.49 
85+ 324.35 375.34 424.38 457.22 473.46 455.13 480.50 472.68 
White Females 
Period 
Age 1973-74 1975-79 1980-84 1985-89 1990-94 1995-99 2000-04 2005-08 
20-24 0.23 0.15 0.22 0.27 0.35 0.31 0.31 0.34 
25-29 0.68 0.61 0.53 0.62 0.50 0.43 0.59 0.65 
30-34 1.75 1.77 1.20 1.23 1.35 1.14 1.18 1.37 
35-39 5.66 5.89 5.67 4.23 3.60 4.22 4.07 3.07 
40-44 13.86 16.56 16.48 12.75 10.81 9.03 11.65 9.08 
45-49 26.63 34.08 33.33 33.82 27.67 23.09 23.89 25.01 
50-54 40.64 52.43 65.78 66.66 62.08 51.71 46.67 44.81 
55-59 63.75 81.27 98.54 115.71 113.78 102.12 96.61 78.07 
60—64 76.88 105.93 135.84 164.30 172.65 166.89 170.44 149.49 
65-69 84.30 115.08 165.17 200.91 234.38 248.80 254.54 249.04 
70-74 72.64 110.21 167.20 230.46 273.19 307.13 326.30 327.83 
75-79 71.32 96.72 142.09 210.70 282.92 315.38 353.70 375.94 
80-84 64.87 76.25 120.73 168.55 236.59 292.49 322.44 350.92 
85+ 59.66 74.33 91.22 118.60 146.34 188.93 227.06 249.75 
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Repeated cross-sectional data designs have largely been analyzed as age- 
by-time period arrays in which age, time period, and birth cohort are con- 
sidered same-level factors affecting the outcome of interest. The subsequent 
modeling approach suffers from a major problem, called the “model identi- 
fication problem” or APC conundrum, induced by the algebraic relationship 
between A, P, and C noted. Different temporal groupings for the A, P, and 
C dimensions can be used to break the linear dependency. For example, one 
can use single years of age, time periods corresponding to years in which 
vital statistics are recorded or surveys are conducted (which may be sev- 
eral years apart) and cohorts defined by either 5- or 10-year intervals that 
are conventional in demography or application of substantive classifications 
(e.g., war babies, baby boomers, baby busters, etc.). We defer the discussion of 
limitations of using this approach in conventional linear regression models 
to Chapter 4. Here, we simply note that, when the data used to construct the 
age-by-time period data structure of Figure 3.1 are in such a form that differ- 
ent temporal intervals can be used for the age, time period, and cohort group- 
ings, the same design can be considered alternatively as a multilevel data 
design whose structure is illustrated in Figure 3.2; each row is a cohort, and 
each column is a time period. Note that data balanced in age-by-time period 
arrays (as shown in Figure 3.1) are necessarily unbalanced in cohort-by-time 
period arrays. In this design, individual-level age-specific observations are 
nested in and cross-classified simultaneously by two higher-level social con- 
texts defined by time period and birth cohort. This multilevel data struc- 
ture then motivates the employment of a different modeling approach (the 


Period 


Cohort 


FIGURE 3.2 
Cohort by period cross-classification data structure. 
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generalized linear mixed model [GLMM] approach described in Chapter 4) 
that avoids the model identification problem of classical linear APC models. 


3.3 Research Design I: Age-by-Time Period 
Tabular Array of Rates/Proportions 


In demographic and epidemiologic investigations, researchers typically are 
interested in aggregate population-level or tabular data such as rates of mor- 
bidity, disability, and mortality. As shown in Figure 3.1, the data structure is 
one in which rates or proportions are arranged in rectangular arrays with 
age intervals defining the rows and time periods defining the columns. The 
use of this research design is most widely seen in chronic disease epidemiol- 
ogy and demography. Our empirical analyses focus on the case of malignant 
neoplasm. Cancer has taken the place of heart disease to become the num- 
ber 1 killer of Americans under age 65 (Jemal et al. 2005). Since President 
Richard Nixon signed the National Cancer Act in December 1971, broaden- 
ing the scope and responsibilities of the National Cancer Institute (NCI), the 
nation has engaged itself in 40 years of the "War on Cancer" (Marshall 2011). 
Where do we stand in new cases of malignancies today compared to the ear- 
lier decades? And, how do we fare in survival and longevity in the face of the 
most invincible maladies known to humankind? As both direct and indirect 
indicators of the influences of myriad genetic, social behavioral, economic, 
and environmental factors on population health, the age patterns, temporal 
trends, and birth cohort variation of cancer incidence and mortality rates 
become crucial for understanding the longevity prospect of human beings 
(Manton, Akushevich, and Kravchenko 2009). 


3.3.1 Understanding Cancer Incidence and Mortality Using APC 
Analysis: Biodemography, Social Disparities, and Forecasting 


Although there is a vast literature on cancer morbidity and mortality, several 
questions remain that beg the utilization of APC analysis. First, what is the 
relationship between cancer and aging? Previous studies showed declines in 
tumor progression rates and cancer mortality with advanced age (Frank 2007; 
Manton, Akushevich, and Kravchenko 2009). While this may be contributed 
in part by selective survival that decreases population heterogeneity in older 
ages (Vaupel, Manton, and Stallard 1979), there also are biological explana- 
tions suggesting that the processes of cancer development and senescence 
might interact and be related (Pompei and Wilson 2001; Arbeev et al. 2005). 
It is not clear, however, whether or to what extent the observed declines with 
age are due to cohort dynamics that reflect differences in lifelong exposures 
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to carcinogens and other risk factors unrelated to developmental changes. 
It also remains unknown whether the age declines persist or change across 
historical time. Therefore, APC analysis should be employed to better tease 
out the possible causes of this decline and inform the underlying biological 
processes of aging. 

The second question builds on the first one and concerns social dispari- 
ties in the age patterns and temporal dynamics. For instance, cancer inci- 
dence rates are higher for males than females in older ages (Arbeev et al. 
2005), and the male excess in cancer mortality rates increases with age (Yang 
and Kozloski 2012). In addition to biological differences between men and 
women, the age changes in the sex gaps may be due to social behavioral 
factors such as cigarette smoking that have shown apparent birth cohort 
variations (Pampel 2005). Similarly, racial differences in cancer incidence 
and mortality trends can also be attributed to changes related to age, 
birth cohort, or period components. Tobacco use has been one of the best- 
established behavioral risk factors for cancer and many other degenerative 
diseases (Manton, Akushevich, and Kravchenko 2009). There is evidence 
of large cohort increases in cigarette smoking that initiated, peaked, and 
decelerated in a gender-specific fashion, with women lagging behind men 
by about 20 years (Wang and Preston 2009). There is also evidence that obe- 
sity, a major biobehavioral risk factor for cancer and other chronic diseases, 
shows large increases over time and in more recent cohorts born after 1955, 
and such period and cohort changes are most pronounced in black females 
(Reither, Hauser, and Yang 2009). How differential exposures and vulner- 
abilities to these risk factors lead to sex and racial disparities in trends and 
patterns of cancer morbidity and mortality merits a systematic APC analysis. 

Third, what do current trends in cancer mortality tell us about the future? 
Conventional demographic forecasting techniques make use of only age and 
period variations to project mortality into the future and ignore birth cohort 
variation. Under the condition of no substantial cohort changes in mortality, 
this omission has no serious consequences. However, the accuracy of the 
age-period-based forecast may be significantly decreased when children and 
young adults alive today experience different epidemiologic conditions as 
they age and hence different mortality rates from their predecessors. The 
obesity epidemic is one example of how health conditions across cohorts can 
change quite rapidly and dramatically, and sometimes for the worse. While 
the idea of using APC models for mortality forecasting has been suggested 
by Osmond (1985) and hence is not new, the ever-changing cohort patterns 
of myriad health indicators today call for revisiting the forecasting method 
and its implementation. 

We address these questions by examining data on U.S. cancer incidence 
and mortality rates for multiple sites to reveal their distinct clinical and 
public health characteristics and different implications for socioeconomic 
changes during the years 1973 to 2008. 
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3.3.2 Cancer Incidence Rates from Surveillance, Epidemiology, 
and End Results (SEER): 1973-2008 


The NCIS Surveillance, Epidemiology, and End Results (SEER) Program col- 
lects information through cancer registries in various states on newly diag- 
nosed invasive cancers in the United States. The population coverage of the 
SEER Program was 9.5% of the U.S. population from 1973 to 1991, increased 
to 13.8% from 1992 to 1999, and to 26.2% since 2000. We obtained incidence 
cancer and population count data from 1973 to 2008 by age, year of diagnosis, 
sex, and race from the SEER research data using the SEER*Stat Software. We 
analyzed the trends of the top 20 cancer sites for males and females (Jemal 
et al. 2008): lung and bronchus, prostate, breast, colon and rectum, pan- 
creas, ovary, leukemia, non-Hodgkin lymphoma, uterine corpus, esophagus, 
urinary bladder, liver and intrahepatic bile duct, kidney and renal pelvis, 
stomach, brain and other nervous system, myeloma, uterine cervix, thyroid, 
oral cavity and pharynx, and melanoma of the skin. Site and histology were 
coded according to the International Classification of Diseases for Oncology 
(ICD-O) edition in use at the time of diagnosis and converted to the third 
edition coding. The SEER site recode is provided at http://seer.cancer.gov/ 
siterecode/icdo3_d01272003/. Cases of cancer incidence were associated with 
the population estimates using racial and ethnic groups that changed over 
the period. These groups included white, black, and other (American Indian/ 
Alaska Native and Asian/Pacific Islander combined) for the period before 
1992 and include an additional Hispanic origin after 1992. For analyses of 
long-term trends, we used former data for the three major racial and ethnic 
populations. More details on population estimates used for the calculation 
of cancer incidence are provided at http://seer.canc er.gov/data/index.html. 

Table 3.1 shows the age-specific lung cancer incidence rates for white males 
and females for ages 20 to 85+ and time periods from 1973 to 2008. Birth 
cohort-specific rates lie along the diagonals of the age-by-time period rectan- 
gular array. There are a = 14 five-year age groups, p = 8 five-year periods, and 
c = 21 ten-year birth cohorts that range from those born before 1885 to those 
born in 1985 (marked by the midinterval birth year). This rate table is repre- 
sentative of data widely used in population studies and all cancer incidence 
rate data used in subsequent analyses. 


3.3.3 Cancer Mortality Rates from the National Center 
for Health Statistics (NCHS): 1969-2007 


We obtained the U.S. cancer mortality data collected by the National Center 
for Health Statistics (NCHS) using the SEER*Stat software for the period 
1969 to 2008 by sex and race. The International Classification of Diseases (ICD) 
versions changed over this time period (ICD-8 to ICD-10). The underlying 
causes of death are based on the death certificate information reported to the 
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Centers for Disease Control and Prevention (CDC) National Vital Statistics 
and categorized according to SEER site groups to ensure comparability among 
ICD versions (Jemal et al. 2008). The SEER cause of death recode is listed 
at http://seer.cancer.g ov/codrecode/19 69+_d09172004/. Denominators in 
the death rate computation are from county-level population estimates that 
were summed to the state and national levels. More detailed descriptions of 
the methodologies used for the population estimates by the Census Bureau 
and the NCI are available elsewhere (Jemal et al. 2008). Similar to cancer 
incidence data, cancer mortality (or other cause-specific mortality) data from 
the NCHS were not consistently recorded for every racial and ethnic group 
for all periods. To facilitate comparison with the incidence data across time, 
we analyzed the trends for the three major racial groups: white, black, and 
other. All cancer mortality data can be accessed at the SEER website: http:// 
seer.cancer.gov/mortality/index.html. 

Table 3.2 presents the age-specific lung cancer death rates for white males 
and females for ages 20 to 85+ and time periods from 1969 to 2007. Similar to 
the incidence rate data, there are a = 14 five-year age groups, p = 8 five-year 
periods, and c = 21 ten-year birth cohorts that range from those born before 
1885 to those born in 1985 (marked by the midinterval birth year). Data on 
rates for other sites by sex and race take the same form and are presented 
graphically in subsequent analyses. 

The examples given take the form of an age-by-period array using aggre- 
gate age and period groups in equal intervals (5 years). We can also utilize 
differential temporal groupings of the age, year, and cohort variables to 
create a multilevel data array of rates. Using the lung cancer mortality rate 
data as an example, we show the alternative data structure in Table 3.3; 
number of observations and mean are presented by cohort and period. 
Because the SEER website mentioned provides single-year age-specific 
data on cancer incidence rates but not mortality counts, we obtained the 
single-year age-specific data on mortality for a slightly shorter time period 
until 2002 from the National Bureau of Economic Research (NBER) website. 
We used data on national death certificate reports from the NCHS mul- 
tiple causes of death file to obtain the number of cancer deaths. The cause 
of death was determined using ICD-8 for 1969-1978, ICD-9 for 1979-1998, 
and ICD-10 for 1999 and later. The population estimates for 86 single-year 
age groups (0, 1, 2, ..., 84, 85+) are from the SEER U.S. population data as 
introduced previously. For each 5-year birth cohort in a 5-year time period, 
there are multiple single-year age-specific lung cancer death rates that can 
also be sex and race specific. We used data on black and white popula- 
tions only for the multilevel analysis due to the poor quality of data on the 
other race group and lack of information on other covariates for this group. 
In this case, there can be up to 100 observations within each cohort-by- 
period cell. Note that birth cohorts defined in the previous age-by-period 
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TABLE 3.2 


Age-Specific Lung Cancer Death Rates (per 100,000 Population) for White Males 
and White Females, United States 1969—2007 


White Males 
Period 
Age 1969-74 1975-79 1980-84 1985-89 1990-94 1995-99 2000-04 2005-07 
20-24 0.17 0.11 0.12 0.08 0.10 0.11 0.11 0.07 
25-29 0.38 0.35 0.30 0.27 0.23 0.22 0.21 0.20 
30-34 1.82 1.48 1.05 1.11 0.98 0.97 0.63 0.55 
35-39 7.37 5.84 4.87 3.81 3.65 3.03 2.55 1.95 
40-44 20.50 18.34 15.78 13.29 10.70 9.78 9.06 7.27 
45-49 45.87 46.68 42.95 37.44 30.57 23.70 22.38 20.17 
50-54 84.48 93.08 93.12 85.59 73.28 57.86 47.75 44.39 
55-59 149.33 155.63 165.20 164.97 148.33 121.48 101.36 84.08 


60-64 231.90 249.30 252.11 265.38 257.32 217.92 187.06 162.94 
65-69 315.83 345.18 361.64 363.77 373.21 339.60 299.91 269.95 
70-74 367.93 425.23 460.48 476.62 466.60 456.26 421.13 383.22 
75-79 382.20 457.91 516.57 545.13 546.39 521.67 511.77 489.76 
80-84 328.01 430.51 501.16 562.57 590.58 568.68 541.83 541.89 
85+ 240.67 325.83 408.14 466.82 522.39 535.45 493.98 464.79 


White Females 
Period 
Age 1969-74 1975-79 1980-84 1985-89 1990-94 1995-99 2000-04 2005-07 
20-24 0.09 0.08 0.05 0.06 0.06 0.03 0.04 0.08 
25-29 0.23 0.57 0.18 0.21 0.21 0.21 0.16 0.18 
30-34 1.01 2.61 0.79 0.77 0.83 0.83 0.61 0.44 
35-39 3.39 11.77 3.34 2.66 2.71 2.91 2.60 1.87 
40-44 8.45 41.49 9.59 8.72 7.20 7.35 8.04 7.34 
45-49 17.19 97.72 23.63 23.00 20.66 16.46 16.77 17.74 
50-54 27.27 180.81 43.64 47.17 44.99 38.82 32.42 31.30 
55-59 39.01 270.18 68.31 78.57 82.34 75.24 67.29 57.06 


60-64 45.45 343.71 93.42 116.05 129.06 127.23 121.26 110.17 
65-69 49.61 400.83 117.06 148.40 175.60 183.74 183.07 174.83 
70-74 51.37 431.75 123.19 172.18 211.65 235.27 244.42 241.36 
75-79 55.14 385.16 113.02 165.96 223.94 254.68 280.02 289.41 
80-84 56.36 377.58 102.21 143.53 202.69 250.54 279.78 289.47 
85+ 57.88 270.41 92.96 115.62 154.57 193.55 220.49 231.68 
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contingency tables are 10-year cohorts that are labeled by the midinterval 
birth year and overlap with adjacent neighbors. The cohorts in this table 
are actual 5-year groups defined by birth years and mutually exclusive. In 
sum, we can cross classify rates by birth cohort and time period and exam- 
ine age-specific data that can be considered as nested in the cohort-by-time 
period contexts. 


3.4 Research Design II: Repeated Cross-Sectional 
Sample Surveys 


The previous section focused on the possibilities for APC analysis using 
repeated cross-sectional data at the population level. In this section, we show 
additional utilities of such design using microdata from repeated cross- 
sectional sample surveys that are increasingly available to social scientists. 
Compared to the longitudinal panel design, which usually spans a short 
time period, the synthetic cohort approach has the advantage of facilitating 
simultaneous tests of age and period effects because it is based on represen- 
tative national surveys of all ages conducted regularly from one period to the 
next and often covering multiple decades. It suffers less from the difficulty in 
locating sample respondents across time in panel studies, although it is not 
exempt from attrition due to mortality. Pooling data of all ages and survey 
years yields the age-by-time period data structure as shown in Figure 3.1. 
And, arranging data by all birth cohorts and survey years yields the same 
cohort-by-period cross-classification data structure as shown in Figure 3.2. 
In this case, individual respondents (usually not the same individuals in 
more than one survey) of any birth cohort are interviewed in multiple rep- 
lications of the survey, and individual respondents in any particular wave 
of the survey can be drawn from multiple birth cohorts. Although sample 
surveys share these characteristics with tabular data at the aggregate pop- 
ulation level, they possess an additional and important feature. That is, 
repeated cross-sectional surveys provide individual-level data on both the 
responses and a wide range of covariates, which can be employed for much 
finer-grained analyses of explanatory hypotheses. 


3.4.1 General Social Survey (GSS) 1972-2006: Verbal Test Score 
and Subjective Well-Being 


We first used data from General Social Survey (GSS) conducted over three 
decades for detailed methodological expositions. The GSS are a typical 
example of repeated cross-sectional surveys that have monitored the atti- 
tudes and behaviors of adults in the United States since 1972 (Davis, Smith, 
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and Marsden 2005). Each survey uses multistage stratified probability sam- 
pling and includes a nationally representative sample of noninstitution- 
alized adults age 18 and older in the United States. We focused on two 
substantive problems that have attracted attention in previous studies using 
APC analysis. 

The first problem concerns controversy over trends in verbal ability. A 
series of articles published in the American Sociological Review in 1999 cen- 
tered on the existence of an intercohort decline in verbal ability in the GSS 
1974 to 1996. The debate was initiated by Alwin (1991) and Glenns (1994) 
finding of a long-term intercohort decline in verbal ability beginning in the 
early part of the twentieth century. Wilson and Gove (1999) took issue with 
this finding and argued that the Alwin and Glenn analyses confused cohort 
effects with aging effects and ignored time period effects. In response, Glenn 
(1999) disagreed that the decline in GSS vocabulary scores resulted solely 
from period influences and argued against the Wilson and Gove claim that 
cohort differences actually reflected only age effects. The previous findings 
on trends in verbal scores are interesting and suggestive. But, until age, 
period, and cohort effects are simultaneously estimated, the question of 
whether the trends are due to age, period, or cohort components remains 
incompletely resolved. We used this specific example to motivate the statisti- 
cal methodology we present in Chapter 6. The substantive results are there- 
fore presented for illustration. 

Table 3.4 summarizes the descriptive statistics of the verbal test score data 
we analyzed. This is an extension of the 1974-1996 data on which the con- 
troversy was based and includes 17 cross sections of the GSS: 1974, 1976, 
1978, 1982, 1984, 1987, 1988, 1989, 1990, 1991, 1993, 1994, 1996, 1998, 2000, 2004, 
and 2006. In these surveys, a survey respondent's vocabulary knowledge 


TABLE 3.4 
Summary Statistics for GSS Vocabulary Test Data, 1974—2006 
Description and Coding N Mean SD Min Max 
Dependent Variable 
WORDSUM A 10-item composite 22,042 6.05 2.13 0 10 


vocabulary scale score 


Level 1 Variables 


AGE Age at survey year 22,042 45.13 1737 18 89 

EDUCATION Highest levels of education 22,042 12.84 3.02 0 20 
respondent completed 

SEX 1 = female; 0 = male 22,042 0.57 0.5 0 1 

RACE 1 = black; 0 = white 22,042 0.15 0.36 0 1 

Level 2 Variables 

COHORT Five-year birth cohorts 20 = — -1894 1985-89 


PERIOD Survey years 17 = — 1974 2006 
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is measured by a composite scale score named WORDSUM, which is con- 
structed by adding the correct answers to 10 verbal test questions and ranges 
from 0 to 10. WORDSUM has an empirical frequency distribution that is 
approximately bell shaped with a mean of about 6 and is reported in previ- 
ous studies to have an internal reliability of .71 (Wilson and Gove 1999:258)." 
The data include 22,042 respondents who had measures on WORDSUM 
and other covariates across all survey years. Respondents' ages in the data 
pooled across all surveys varied from 18 to 89. The average years of education 
completed was around 12.8 years. Fifty-seven percent of respondents were 
female, and 1576 were black. There were 20 five-year birth cohorts. The oldest 
cohort member was born before 1894 and the youngest born in 1985-1989. 
As has been introduced previously and will be clearer in chapters to follow, 
gender, race, and education are level 1 covariates, and survey years and birth 
cohorts are level 2 contextual variables in hierarchical model specifications. 
Table 3.5 further illustrates the two-way cross-classification structure of the 
verbal ability data in terms of mean verbal test scores of individual respon- 
dents in cohort by period cells. 

A second problem of interest to APC analysts that can be studied with GSS 
data is trends in social inequalities of subjective well-being. As increases in 
life expectancies in the United States continue into the twenty-first century, 
there has been a growing need for research and policy to take into account 
both the quantity and the quality of life. A fundamentally important ques- 
tion for research communities, policy makers, and public authorities is, Are 
Americans living better as well as longer lives? The measure of subjective 
quality of life that has been examined most frequently is general happiness. 
Previous analyses of correlates of happiness have mostly been concerned 
with cross-sectional individual-level characteristics and attainments. We 
knew relatively little beyond the stratification of subjective quality of life at 
a static point in time. Specifically, there is little knowledge of the social het- 
erogeneity of life course patterns, time trends, and birth cohort differences 
in general happiness. And, still less is known about individual- and macro- 
level mechanisms underlying these patterns and trends. We use this as an 
example to show how a systematic APC analysis can reveal the various ways 
in which social stratification operates over the life course and historical time, 
test the extent to which aging and life course theories apply to subjective 
well-being, and shed new light on the changes in the social distribution of 
quality of life in the United States. 

Table 3.6 summarizes the descriptive statistics of the happiness data 
we analyzed. Happiness was assessed as a single-item scale reported from 
respondents. The data on happiness are available annually in 22 survey years 


` In an item analysis of individual words in WORDSUM, Alwin (1991: 628) found that some 
of the words have become more difficult over time. The general conclusion in this series of 
articles (Alwin and McCammon 1999; Glenn 1994; Wilson and Gove 1999), however, was that 
word obsolescence does not account for observed changes in the test scores over time. 
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from 1972 to 1994 (except for 1979, 1981, and 1992) and biannually from 1994 
to 2004. In all years, the GSS item on overall happiness was the following: 
“Taken all together, how would you say things are these days—would you 
say that you are very happy, pretty happy, or not too happy?” The responses 
were coded as 1 (= very happy), 2 (= pretty happy), and 3 (= not too happy). 
Evidence of the psychometric adequacy of this measure has been provided 
elsewhere (Yang 2008). The data include 28,869 respondents who had mea- 
sures on HAPPY and other covariates across all survey years. Respondents’ 
ages ranged from 18 to 89. There were 18 five-year cohorts born between 
1899 and 1986. The operational definitions and descriptive statistics of level 1 
and level 2 covariates used in the analysis are reported in the table. In addi- 
tion to period and cohort, three additional covariates at level 2 are included. 
The cohort-level covariate is relative cohort size (RCS) adopted from O'Brien 
(2000). Two period-level covariates are annual gross domestic product (GDP) 
per capita in thousands published by the U.S. Department of Commerce and 
the annual unemployment rate published by the U.S. Department of Labor, 
Bureau of Labor Statistics. 


3.4.2 National Health and Nutrition Examination Surveys (NHANES) 
1971-2008: The Obesity Epidemic 


Studies of the U.S. population have consistently shown a disconcerting 
increase in obesity rates in recent decades (Flegal et al. 2002; Freedman et al. 
2002; Mokdad et al. 2001; Ogden et al. 2002). According to estimates from the 
National Health and Nutrition Examination Survey (NHANES), the preva- 
lence of obesity among U.S. adults more than doubled in the last 20 years 
of the twentieth century (Flegal et al. 2002). Although specific explanations 
vary, the vast majority share the common view that secular changes in U.S. 
society (i.e., period effects) lie at the root of the obesity epidemic. Since there 
has been insufficient research to disentangle period effects from age and 
birth cohort effects, the importance of secular change relative to cohort 
membership is yet to be examined analytically. In a study of 1.7 million par- 
ticipants in the 1976—2002 National Health Interview Survey (NHIS), Reither, 
Hauser, and Yang (2009) employed APC analysis to determine what role 
birth cohorts have played in the U.S. obesity epidemic. Results confirmed 
that period effects are principally responsible for the epidemic, but they 
also demonstrated that cohort membership is more influential than previ- 
ously assumed. For example, relative to the 1955—1959 birth cohort, the odds 
of class 2 obesity (body mass index [BMI] > 35) increased by 50% for the 
1980-1984 birth cohort. They concluded that secular change and birth cohort 
membership have independently contributed to elevated odds of obesity 
among recent cohorts of Americans. 

We built on this study to further examine the obesity epidemic and under- 
lying mechanisms related to distinct age, period, and cohort influences. 
We addressed two remaining questions. First, because the BMI is based on 
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self-reported weight and height in the NHIS, it potentially can deviate from 
actual BMI. Therefore, we use data on measured BMI from the NHANES 
to more directly compare results regarding period and cohort effects with 
estimates from the same data that first established the period trends in 
obesity (Flegal et al. 2002). Second, the Reither, Hauser, and Yang (2009) 
study estimated A, P, and C effects for each stratified sample by sex, race, 
and education. Although the results showed differences in these effects for 
men and women, black and white, and the college educated and less edu- 
cated, the differences were not assessed quantitatively. We formally tested 
the hypothesis of social disparities in trends of obesity by estimating sex, 
race, and educational differences in A, P, and C effects. We also accounted 
for other individual-level risk factors for obesity, such as family income, in 
the analysis. 

The NHANES, conducted by the NCHS, uses a multistage stratified 
sampling design and includes a representative sample of the noninstitu- 
tionalized U.S. population, with an oversample of older persons and minori- 
ties (CDC 2010). There are nine survey periods: NHANES I (1971-1975); 
NHANES II (1976-1980); NHANES III, phase 1 (1988-1991); NHANES III, 
phase 2 (1991-1994); 1999-2000; 2001-2002; 2003-2004; 2005-2006; and 2007- 
2008. The study includes respondents aged 25 to 74 who attended household 
interviews and clinical examinations between 1971 and 2008. We also exam- 
ined all available ages that are as young as 2, but only report results on adults 
above the age of 25 who were more likely to have completed education and 
for whom meaningful measures of income are available. Birth years were 
calculated using survey year (midinterval value) minus respondents’ age. 
This resulted in 17 five-year birth cohorts born between 1899 and 1982, with 
the earliest cohort spanning six years (1899-1904) and the most recent cohort 
spanning just three years (1980-1982). We excluded foreign-born respon- 
dents due to their lack of the same period and cohort experiences and expo- 
sures with the native respondents. We further excluded respondents who 
were pregnant when weighed. The final sample included 40,261 individuals. 
Obesity status was assessed using BMI calculated from examination data 
on measured height and weight. When examination data were missing, self- 
reported height and weight was used. For adult respondents in our sample, 
obesity was defined as BMI > 30.0 kg/m?“ Because data on Hispanic ethnicity 
were not available until 1988, we used the black-white race variable available 
for all survey years. For descriptive analysis, we used the sampling weight 
to adjust for the effects of survey design and nonresponse. Table 3.7 pres- 
ents weighted summary statistics of the obesity outcome; five individual-level 
covariates (age, sex, race, education level, and income quartiles); and level 2 
variables of time period and birth cohort. 


* We thank Dr. Whitney Robinson for assistance with data preparation, including providing 
data on the measured weight and height for nine survey periods, construction of the obesity 
variable across ages, as well as substantive expertise on the measures and trends of obesity. 
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TABLE 3.7 
Summary Statistics for NHANES Obesity Data, 1971-2008 
Description and Coding N Mean SD Min Max 
Dependent Variable 
OBESE Measured BMI categorized 40,261 0.30 0.46 0 1 
into obesity status: 
1 = obese (BMI > 30); 
0 2 not obese 
Level 1 Variables 40,261 
AGE Age in years; centered 40,261 46 15 25 74 
around grand mean 
SEX Respondent's sex: 1 = male; — 40,261 0.48 0.50 0 1 
0 = female 
RACE Respondent's race: 1 = 40,261 0.12 0.33 0 1 
black; 0 = white 
EDUCATION  Respondent's years of 40,261 0.50 0.50 0 1 
schooling (reference = 
12 years or less) 
SOME 1 = 13-15 years of 40,261 0.26 0.44 0 1 
COLLEGE schooling; 0 = otherwise 
COLLEGE 1 = 16 or more years of 40,261 0.24 0.43 0 1 
schooling; 0 = otherwise 
FAMILY Total family income 40,261 39,797.7 | 32,305.5 0 120,407.7 
INCOME adjusted to 2000 dollars 
(reference = 2nd and 3rd 
income quartiles) 
LOWER 1 = Lowest 25% in each 40,261 0.21 0.41 0 1 
QUARTILE wave; 0 = otherwise 
UPPER 1 = Highest 25% in each 40,261 0.28 0.45 0 1 
QUARTILE wave; 0 = otherwise 
Level 2 Variables 
COHORT Five-year birth cohort 17 = = 1899 1982 
PERIOD Survey year 9 = = 1971 2008 


3.4.3 National Health Interview Surveys (NHIS) 1984-2007: 
Health Disparities 


Social disparities in health or simply health disparities are a long-standing 
research problem in medical sociology and social epidemiology. As data 
from more large surveys accumulate over time and become available to ana- 
lysts, the question of trends and changes in health disparities also becomes 
more relevant. In addition to a large body of demographic and epidemio- 
logic research on age variation and temporal trends in health, research on 
health disparities across the life course, cohorts, and time periods is also rap- 
idly growing. We defer the review of research literature on changing health 
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disparities to Chapter 6. In general, there is evidence for increases in gender, 
race, and socioeconomic inequalities across the life course, birth cohorts, and 
time periods in the United States. Two questions remain. First, to what extent 
do increasing disparities in specific domains of social stratification contrib- 
ute to increasing overall health disparities (dispersion or variance of health 
outcomes)? Second, to what extent are disparities across ages, cohorts, and 
time periods independent of each other? 

A major limitation of prior research is that it has treated these three time 
dimensions separately. However, they can be intertwined. For example, an 
increase in health disparities across time periods may result from cohort 
replacement, in which cohorts with larger within-cohort health disparities 
succeed cohorts with smaller within-cohort health disparities, or an aging 
society, wherein the elderly, who usually have larger within-age health dis- 
parities than younger people, increase their proportionate share in the popu- 
lation structure, or from some combination of the two. Similarly, a widening 
health disparity with age may be confounded with period or cohort patterns. 
Some studies have tried to disentangle age and cohort patterns in health 
disparities and have found distinct age effects and cohort variations in 
mean levels of health and changing health disparities by education, income, 
gender, and race over life course and across birth cohorts (Chen, Yang, and 
Liu 2010; Lauderdale 2001; Lynch 2003; Yang and Lee 2009). Lynch (2003) 
also found each pattern was suppressed when the other one was ignored. 
However, there is a lack of simultaneous assessment of the effects of age, 
period, and cohort in health disparities. 

We addressed this gap in the literature using data on self-reported health. 
Self-rated health is a widely used measure of general health status that has 
been found to be highly predictive of mortality and strongly correlated 
with objective assessments of health, including physician diagnoses (Idler 
and Benyamini 1997). In fact, self-rated health is a good indicator of objec- 
tive health and subclinical illness and has been found in some studies to be 
more predictive of mortality among the elderly than physician assessments 
(Schoenfeld et al. 1994). Close relationships between self-rated health and 
objective health indicators also hold across population subgroups (Bosworth 
et al. 2001). Based on these and related findings regarding its robustness as a 
single, summary index of an individual's health status, we studied self-rated 
health as a health outcome variable. But, we also caution that health is not a 
singular condition, and findings in this chapter may not always generalize to 
health disparity trends associated with specific health outcomes. 

Our analysis was based on annual data from the NHIS for the 24-year 
period 1984 to 2007. The NHIS is a multistage probability sample survey of 
the civilian noninstitutionalized U.S. population conducted by the NCHS. 
NHIS collects health information for each member of a family or household 
sampled, as reported by one primary respondent. To reduce reporting/ 
measurement errors, we limited our analysis to the primary respondent. The 
sample size for men was about 16,670 each year (in total 16,670*24 = 400,080), 
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TABLE 3.8 

Summary Statistics for Self-Reported Health Data from NHIS, 1984-2007 
Description and Coding N Mean SD Min Max 

Dependent Variable 

HEALTH Respondent's self-rated health: 701,888 3.76 1.13 1 5 


1 = poor, 2 = fine, 3 = good, 
4 = very good, 5 = excellent 


Level 1 Variables 


SEX 1 = man, 0 = woman 701,888 0.57 0.49 0 1 
RACE 1 = white, 0 = other races 701,888 0.82 0.38 0 1 
AGE Respondent's age at survey year 701,888 46.55 17.30 18 85 
EDUCATION Respondent's years of schooling 701,888 12.65 3.17 0 18 
MARRIED 1 = married, 0 = others 701,888 0.55 0.50 0 1 
EMPLOYED 1 = employed, 0 = others 701,888 0.67 0.47 0 1 
INCOME Household income in thousands 701,888 4.58 2.76 0.07 10.41 
REDESIGN Effect of survey redesign 701,888 0 1 
1 1995-2007, 0 = 1984-1994 

Level 2 Variables 

COHORT Five-year birth cohorts 18 — — 1899 1985 
PERIOD Survey year 24 — — 1984 2007 


and for women was about 12,575 each year (in total 12,575*24 = 301,800). The 
outcome variable, self-rated health, has remained largely unchanged across 
periodic revisions of the NHIS questionnaires, which facilitates the analy- 
sis of trends. It has five response categories: poor, fair, good, very good, 
and excellent. Table 3.8 shows the summary statistics for self-rated health 
and individual-level demographic and social variables that have been linked 
to health in previous research. The respondents were aged 18 to 85 and 
belonged to 18 five-year cohorts born between 1899 and 1985. 

The sampling frame for the NHIS is redesigned every 10 years and was 
redesigned, during the period studied here, in 1995. Nonetheless, the fun- 
damental design of the 1995-2007 NHIS was similar to that of the 1985-1994 
NHIS. Three changes in the sampling design and weighting structure are 
notable. First, the number of primary sampling locations has increased from 
198 to 358 since 1995. Second, both black and Hispanic populations were 
oversampled in the 1995-2007 NHIS, while only blacks were oversampled 
in the 1985-1994 NHIS. Third, the weighting structure changed after 1996. 
These three changes potentially affect the sample variances after 1995 and 
1996. Therefore, we used the sample weights in all analyses to adjust for the 
multistage sampling design. We also created an indicator/dummy variable 
named "redesign" to adjust the regression model estimates for any effects of 
sampling design changes since 1995. 
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3.4.4 Birth Cohort and Time Period Covariates Related to Cancer Trends 


As mentioned in Section 3.3.1, an extension of the cancer trend analysis is 
to examine the role of two prominent biobehavioral risk factors for cancer 
incidence and mortality, cigarette smoking and obesity, in explaining the 
temporal trends in cancer incidence and mortality. Diagnostic and screen- 
ing techniques (such as colonoscopy, mammography, and prostate-specific 
antigen [PSA] screening) and treatment (such as hormonal replacement ther- 
apy [HRT]) changes have also been frequently associated with time period 
changes in cancer incidence. To facilitate more detailed statistical investiga- 
tions of cancer trends in relation to hypothesized period and cohort-level 
covariates, we constructed period and cohort data on smoking and obesity, 
and period data on HRT and mammography from national sample surveys 
and other published sources. 

Data on Americans' smoking behaviors came from the NHIS. Respondents 
in the surveys were first asked if they had smoked 100 cigarettes in their life- 
time. If they responded in the affirmative, they were then asked if they cur- 
rently smoked cigarettes. We adopted the cohort smoking variable used by 
Wang and Preston (2009) that was an estimate of the average number of years 
spent as a current smoker before the age of 40 based on the NHIS conducted 
between 1965 and 2005. Compared to single measures of smoking status, this 
measure indexes the past behavior and history of surviving cohort mem- 
bers and hence better captures the cumulative exposures to carcinogens that 
contribute to cohort differences. The estimates pertain to 5-year female and 
male birth cohorts from 1885 and 1984. We constructed period-specific rates 
of smoking based on three measures available in the NHIS: percentage of 
current smokers, percentage quit smoking (or smoking cessation), and per- 
centage ever smoked. Based on the original data that are available for most 
years from 1970 to 2007, we created 5-year sex- and race-specific (black and 
white) smoking rates for the same eight periods of the cancer data and inter- 
polated the data for the period 1980-1984 using the neighboring periods. We 
only present the prevalence of current smokers in Table 3.9 as it appeared to 
be the best period effect predictor among the three. 

Data on obesity came from the NHANES as described. We created rates 
of obesity as defined in Table 3.7 based on measured BMI by both cohort 
and period. The cohort-specific obesity rates were calculated as percentage 
of obese members in 5-year birth cohorts by sex and race (black and white). 
Because the NHANES studies were not conducted every year, we used data 
from all survey years to construct 5-year obesity rates that applied to approx- 
imately the same 5-year periods for the cancer data. We interpolated the rates 
for 1980-1984 and 1995-1999 using estimates from adjacent periods. 

Data on diagnostic and screening hardly exist for the entire period of the 
past 30 years on which the cancer trends analysis was based. We were able to 
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TABLE 3.9 


Age-Period-Cohort Analysis 


Major Risk Factors for Cancer Incidence and Mortality by Birth Cohort 


and Time Period 


Data Source Description and Coding Mean SD Min Max 
Cohort-Level Variables 
SMOKE National Health Average number of years 10.08 4.69 0.88 17.9 
Interview spent as a cigarette 
Surveys? smoker before age 40 
among men and women 
in different birth cohorts 
OBESITY National Health Percentage of respondents 23.88 956 53 473 
and Nutrition ` who are obese based on 
Examination measurements obtained 
Survey, from a physical 
1971-2008? examination (1 = obese) 
Period-Level Variables 
SMOKE National Health Percentage of respondents 31.37 8.99 17.94 53.33 
Interview who currently smoke 
Survey Public (1 = yes) 
Use Data File 
1970-2007% 4 
OBESITY National Health Percentage of respondents 26.36 10.27 12.2 52.8 
and Nutrition who are obese based on 
Examination measurements obtained 
Survey, from a physical 
1971-2008 examination (1 = yes) 
HRT Kaiser Percentage of women 9.14 6.26 2.64 19.63 
Permanente aged 45 year and older 
Northwest who were dispensed at 
(KPNW) least one prescription of 
Outpatient estrogen plus progestin 
Pharmacy in each period 
System (1 = dispensed) 
MAMMOGRAM National Health Percentage of women 3993 28.76 2.33 73.97 
Interview aged 40 years and older 
Survey Public who reported a 
Use Data File mammogram within the 
2000, 2005 past 2 years in each 


period (1 = reported 
mammogram) 


a Source: Data from Haidong Wang and Samuel H. Preston, Proceedings of the National Academy 
of Sciences of the United States of America 106, no. 2 (2009): 393-398; constructed by Burns et al., 
Cigarette smoking behavior in the United States,” in Changes in Cigarette-Related Disease Risk 
and Their Implications for Prevention and Control, edited by D. M. Burns, L. Garfinkel, J. M. 
Samet, Bethesda, MD: National Institutes of Health, 1998: 113-304; updated estimates sup- 
plied by David M. Burns, June 29, 2005. 

^ Sample includes white and black respondents over the age of 19. 


^ 


a 


Respondents first affirmed that they had smoked 100 cigarettes (i.e., 5 packs) in their lifetime. 
Sample includes white and black respondents over the age of 19. 
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find two covariates pertinent to breast cancer incidence based on studies pre- 
viously published on most years during this period. Using the data from the 
Kaiser Permanente Northwest (KPNW) Outpatient Pharmacy System, Glass 
et al. (2007) reported that breast cancer incidence showed a sharp decline fol- 
lowing reduced use of HRT in 2002-2003. And, it was documented that the 
rise and fall of HRT formulations containing estrogen were also followed by 
the increase and decrease in a hormonally related cancer in the 1970s (Jemal, 
Ward, and Thun 2007). The study by Glass et al. (2007) suggested similar over- 
all period trends in the percentage of women dispensed at least one prescrip- 
tion per year from two different hormone therapies: unopposed estrogens 
and estrogen plus progestin. Because the latter has shown a stronger associa- 
tion with the risk of breast cancer according to the American Cancer Society's 
report (http://www.cancer.org/Cancer/BreastCancer/DetailedGuide/breast- 
cancer-risk-factors), we constructed a period variable of HRT usage based 
on this therapy only. The original data are for two separate age categories: 
45—59 and 60 and older and only start from year 1988. We combined the 
two age groups' estimates using the total population size in each group in 
each period as the weights. We then imputed the usage data by a backward 
extrapolation to year 1970 based on the observation that the rates showed 
a strong log-linear increasing trend from 1988 to 1999. The final variable of 
period pattern of HRT usage indicates the percentage of women aged 45 and 
older who were dispensed at least one prescription of estrogen plus proges- 
tin in each of eight 5-year periods from 1970 to 2006. The second period-level 
variable for breast cancer analysis is the percentage of women aged 40 years 
and older who reported a mammogram within the past 2 years. The origi- 
nal estimates came from the study conducted by Smith et al. (2008) using 
the NHIS data 1987-2005. We conducted a similar backward extrapolation 
to year 1970 in log-linear scale for black and white women and grouped the 
period-specific percentages by 5 years. 


3.5 Research Design III: Prospective Cohort Panels 
and the Accelerated Longitudinal Design 


The former two designs using repeated cross-sectional data adopted a syn- 
thetic cohort approach. That is, there was no information about cross-time 
linkages within individuals, and each synthetic cohort contained individuals 
from different cohorts at each point in time. Inferences drawn from such 
designs therefore assume that synthetic cohorts mimic true cohorts, and 
changes over time across synthetic cohort members mimic the age trajec- 
tories of change within true cohorts. If the composition of cohorts does not 
change over time due to migration or other factors and sample sizes are large, 
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FIGURE 3.3 
Accelerated longitudinal panel design. 


these assumptions are generally met. However, longitudinal data obtained 
from the same persons followed over time are increasingly available. The 
primary advantage of longitudinal panel data designs is that they provide 
cross-time linkages within individuals and hence information pertaining to 
true birth cohorts. 

The simplest longitudinal design follows one birth cohort for a period of 
time. As pointed out in Chapter 1, this is insufficient for cohort analysis due to 
lack of data on different cohorts. An accelerated cohort design follows multiple 
cohorts forward over multiple points in time (Tonry, Ohlin, and Farrington 
1991) and is an important advance in aging and cohort research. This design 
allows a more rapid accumulation of information on age for multiple cohorts 
than does a single longitudinal cohort design and makes the cohort analysis 
possible. An example of this data structure is shown in Figure 3.3, where 
four birth cohorts were followed up at four time points. The columns repre- 
sent their ages at each measurement, and the rows represent trajectories of 
change for each cohort. This design is especially useful for testing aging- and 
cohort-related hypotheses because it allows the distinction between intrain- 
dividual change with age within cohorts and intercohort differences. 

We continue the examination of health disparities for empirical illustra- 
tions with a focus on cohort variations in social disparities in health and 
comorbidity over the life course. The process by which health deteriorates 
or is maintained as individuals age and its relationship to social status 
have attracted great attention in recent social and epidemiologic research. 
But, findings have been inconsistent regarding whether social disparities in 
health grow or diminish over the life course. One key issue that has con- 
tributed to such inconsistency is the confounding of aging and the cohort 
succession process. This is a long-standing problem that has hindered our 
ability to adequately test competing theories, such as the cumulative advan- 
tage, age-as-leveler, selective survival, and double-jeopardy hypotheses. 
Following Riley's lead (1987), we introduce an innovative component to 
this body of research—cohort variations in aging experiences—to resolve 
the inconsistency. An earlier study conducted by Yang (2007) using an 
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accelerated longitudinal panel dataset on depression discovered indepen- 
dent age and cohort effects: Birth cohorts with different formative experi- 
ences had distinct trajectories of change in mental health with age. This 
highlights the relevance of social historical context represented by cohort 
membership to health outcomes above and beyond socioeconomic and more 
proximate behavioral correlates. 

Building on this study, we further analyzed intercohort variations and 
intracohort disparities in health over the life course. Because health is 
quintessentially a multidimensional phenomenon, we expanded the health 
outcomes to include multiple indicators of physical and mental health. We 
also paid attention to its different manifestations by examining both indi- 
vidual and cumulative health indicators. We drew on studies published by 
Yang and Lee (2009, 2010) for empirical applications. And, we discuss cross- 
national health impacts of cohort change by comparing findings from these 
studies with a recent Chinese study (Chen, Yang, and Liu 2010). These stud- 
ies jointly provide strong tests of the proposition that considering the process 
of cohort change is important for the theory, measurement, and analysis of 
social inequalities in health over the life course. 


3.5.1 Americans’ Changing Lives (ACL) Study 1986-2002: 
Depression, Physical Disability, and Self-Rated Health 


The Americans’ Changing Lives (ACL) study is a long-term nationally repre- 
sentative longitudinal survey of the adult noninstitutionalized U.S. popula- 
tion (House et al. 2005). It uses an accelerated longitudinal design wherein an 
initial sample of 3,617 individuals from a broad array of ages (25 and older), 
and thus of multiple birth cohorts (before 1905 to 1964), were interviewed 
in 1986 and monitored with three follow-up surveys in 1989, 1994, and 
2001-2002. The analytic sample used by Yang and Lee (2009) consisted of all 
black and white respondents at the baseline (N = 3,497 in 1986) and at subse- 
quent waves for which their data on all variables were available: N = 2,780 in 
1989, N = 2,331 in 1994, and N = 1,566 in 2001. The fourth wave included fewer 
than 100 proxy interviews that were treated as self-reported interviews. The 
majority of attrition in subsequent waves is due to non-response and death. 
The number of nonrespondents was 557 at wave two, 476 at wave three, and 
607 at wave four; the number of those who died was 160 by wave two, which 
increased to 530 by wave three and increased to 1153 by the final wave. A 
small number of observations in the last two waves were excluded due to 
missing values for one or more covariates. Together, these yielded 10,174 
person-year observations. 

The ACL study uses a complex sampling design that oversamples blacks 
and adults over 60 years of age. A composite weight variable was developed 
for each wave to adjust sample distributions for the probabilities of selection 
within households, geographical and race group differences, differential non- 
response, and poststratification (to match the demographic distributions of the 
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known population as estimated by the 1985 U.S. census). All statistical analy- 
ses employed this composite sampling weight to produce unbiased estimates. 

We examined three specific health outcomes. Depressive symptoms were 
measured by a standardized index created by the ACL study from an 11-item 
Center for Epidemiologic Studies Depression Scale (CES-D) of self-reported 
symptoms of depression, such as feeling depressed, lonely, sad, and so on, 
the reliability and validity of which have been well established (Radloff 1977). 
Physical disability was indicated by level of difficulty in performing activities 
of daily living (ADL) and of instrumental ADL (IADL) such as being con- 
fined to bed or a chair and climbing a few flights of stairs. A summary index 
was created by the ACL to indicate an increasing level of functional dis- 
ability, with 1, 2, 3, and 4 equal to no, least severe, moderately severe, and 
most severe functional impairment, respectively. Similar to the NHIS data 
introduced previously, self-rated health was measured by a scale indicating 
perception of general health, with 1 = poor, 2 = fair, 3 = good, 4 = very good, 
and 5 = excellent. 

Age at baseline interview ranged from 25 to 95+. Ages above 95 were 
recoded as 95 to avoid erratic estimates due to extremely small sample sizes 
for these ages. Using age and the baseline year of 1986, we grouped respon- 
dents into seven 10-year birth cohorts. This operationalization of cohort 
grouping is conventional in demographic analysis and distinguishes cohorts 
in a way that is qualitatively meaningful: cohorts 0-6 refer to those born 
before 1905, young Progressives (1905-1914), Jazz Age Babies (1915-1924), 
Depression Kids (1925-1934), War Babies (1935-1944), and Baby Boomers 
(1945-1954, 1955-1964), respectively (Hughes and O'Rand 2004; Yang 2007). 
We experimented with different cohort groupings, such as 5-year intervals 
and unequal intervals corresponding to the unequal intervals between each 
survey, and found largely similar results. The current operationalization, 
however, was superior in terms of parsimony. At each follow-up, surviving 
respondents in each cohort aged together, yielding cohort-specific age tra- 
jectories. The data structure is exemplified in Table 3.10, where the weighted 
means and standard deviations of CES-D scores are presented for each 
cohort, pooled across all surveys by age. The mean CES-D scores tend to 
decrease with age within cohorts. This is a first rough look at the data, and 
the evidence does not support a developmental explanation because cohort 
and age may interact, and other risk factors are not controlled. The lower 
panel shows that the mean CES-D scores are generally higher in earlier 
cohorts at each wave. 

The analyses adjusted for other key social, demographic, and health 
behavior variables summarized in Table 3.11. Chronic illnesses are indicated 
by the number of chronic conditions, including arthritis, lung disease, 
hypertension, heart attack, diabetes, cancer, foot problems, stroke, broken 
bones, and urine beyond control. Based on medical guidelines, we recoded 
the continuous BMI score into categories of underweight («18.5), overweight 
(25 < 30), and obese (30+), with normal (18.5 < 25) being the reference group. 
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TABLE 3.11 
Summary Statistics of All Variables in the ACLs Analyses: 1986—2001/2 (N = 10,174) 
Description and Coding N Mean SD Min Max 

Dependent Variables 

CES-D Standardized score of 104174 -0.14 0.95  -1.18 4.74 
depressive symptoms 

DISABLE Disability: 1 = most severe; 10,174 1.29 0.73 1 4 
4 = no impairment 

HEALTH Self-rated health: 1 = poor; 10,174 3.61 1.03 1 5 


5 = excellent 


Level 1 Covariates 


AGE Respondent'sageatsurvey 10,174 5125 15.66 25 95 
year 
INCOME Family income: 2001 dollars 10,174 53.40 60.09 0 2750 
in thousands 
NOTMARRIED Marital status: 1 = not 10,174 0.31 0.46 0 1 
married; 0 = married 
ILLNESS Number of chronic 10,174 1.06 1.24 0 8 
conditions 
BMI Body mass index 
(reference — normal) 
UWEIGHT Underweight: BMI « 18.5 10,174 0.17 0.38 0 1 
OWEIGHT Overweight: 25 « BMI « 30 10,174 0.30 0.46 0 1 
OBESE BMI > 30 10,174 0.14 0.35 0 1 
SMOKE Number of cigarettes one 10,174 499 1022 0 50 


usually smokes in a day 


Level 2 Covariates 


COHORT 10-year birth cohort 10,174 = <= 0 6 

SEX Respondent's sex: 1 = female; 10,174 0.54 0.50 0 1 
0 = male 

RACE Respondent's race: 1 = black; 10,174 0.11 0.31 0 1 
0 = white 

EDUC Number of years respondent 10,174 12.62 2.89 0 17 
attended school 

DIED Attrition type at follow-up: 10,174 0.15 0.35 0 1 
1 = deceased; 0 = others 

NONRESPONSE Attrition type at follow-up: 10,174 0.18 0.38 0 1 


1 = nonresponse; 0 = others 


Respondent's birth cohort membership, sex, race, and education are time- 
constant or person-level covariates. All other variables are time varying or 
level 1 within-person covariates with measurements at four interviews. We 
also controlled for effects of attrition by including dummy variables indicat- 
ing the deceased and nonrespondents to distinguish those who were lost to 
follow-ups and those who had complete data from all survey waves. 
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3.5.2 Health and Retirement Survey (HRS) 1992-2008: Frailty Index 


The Health and Retirement Survey (HRS) is a nationally representative lon- 
gitudinal survey of the U.S. older population conducted every 2 years from 
1992-1993 to 2008 that includes several birth cohorts with different entry 
years. We extended the study by Yang and Lee (2010) to include the most 
recent wave of data in 2008. We included the maximum number of waves for 
each of the four birth cohorts for whom data on all health deficits and dis- 
orders for the construction of the Frailty Index (FI) are available: the Study 
of Assets and Health Dynamics Among the Oldest Old (AHEAD) cohort 
(born before 1924) surveyed in eight waves from 1993 to 2008; Children of 
Depression (CODA) cohort (born 1924-1930); HRS cohort (born 1931-1941); 
and War Baby (WB) cohort (born 1942-1947). All were surveyed in seven 
waves from 1996 to 2008. We did not include the youngest cohort—the Early 
Baby Boomers (EBB) cohort—because it entered the survey in 2004 and did 
not have sufficient data on age trajectories over time. Cohort membership is 
defined by the actual birth year. Age at first entrance into the samples ranges 
from 70+ for the AHEAD cohort, 66-82 for the CODA cohort, 55-75 for the 
HRS cohort, and 49-64 for the WB cohort. At each follow-up, respondents 
in each cohort aged together, yielding age trajectories of health deficits and 
disorders. The diagram in Figure 3.3 suggests that the observable age trajec- 
tories of different cohorts initiate and end at different ages, so comparison 
of different cohorts potentially can only be based on different segments of 
the cohort members' life course. As waves of data accumulate, however, age 
and cohort will become less and less confounded, making it increasingly 
possible to estimate cohort-specific age trajectories. The HRS data also use 
the accelerated longitudinal panel design but provide more waves of data, 
longer age trajectories, and hence higher power than the ACLs to test aging- 
related hypotheses within the cohort context. The fewer cohorts included, on 
the other hand, may limit the power of tests for cohort differences. 

The analytic samples for all cohorts exclude small numbers of respondents 
who had missing data on any covariates used in the regression analysis. The 
numbers of person-year observations pooling all waves for the AHEAD, 
CODA, HRS, and WB cohorts are 31,565, 17810, 48,485, and 16,565, respec- 
tively, which include observations contributed by respondents who were 
present at all waves, respondents who died at follow-ups (ranging from 5% 
for the WB cohort to 57% for the AHEAD cohort), and those who were non- 
respondents at follow-ups (14% for all). The HRS oversamples racial minori- 
ties such as Hispanics and blacks and provides weighting variables to make 
these representative of the community-based population. All statistical anal- 
yses adjusted for sampling weights (Table 3.12). 

Construction of the FI was based on the subset of health deficits most sim- 
ilar to those assessed in the original studies from the Canadian Study of 
Health and Aging (CSHA) and included symptoms, disabilities or impaired 
functions, disease classifications, and health attitudes (Mitnitski et al. 2002; 
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Mitnitski, Song, and Rockwood 2004). Specifically, we included 30 ques- 
tions available across waves: eight chronic illnesses respondents ever had 
(high blood pressure, diabetes, cancer, lung disease, heart problems, stroke, 
psychological problems, and arthritis); five disabilities in ADL (difficulty 
in walking across a room, dressing, bathing or showering, eating, getting 
in/out of bed); seven disabilities in IADL (difficulty in using a map, the toilet, 
or telephone; managing money; taking medications, shopping for groceries; 
and preparing hot meals); eight depressive symptoms as measured by the 
CES-D (felt depressed; everything an effort; sleep was restless; felt lonely, 
sad; could not get going; enjoyed life; was happy); self-rated health; and obe- 
sity (BMI 2 30). All but one variable are binary, with 1 indicating the presence 
and 0 indicating the absence of the deficit. Self-rated health was a 5-point 
ordinal scale and was mapped into the [0, 1] interval: 0 (excellent), 0.25 (very 
good), 0.5 (good), 0.75 (fair), and 1 (poor). Because six of seven IADL items 
and most of the frailty-associated items were not available for the calculation 
of the FI in the 1992 and 1994 surveys, respectively, we had to omit the data 
for those two waves for the HRS cohort. 

Following extant practice, we defined the FI as a count of deficits for any 
given person divided by the total number of possible deficits. With no miss- 
ing data, the denominator would be the theoretical maximum for all individ- 
uals (30 in this study). In the HRS data, although the majority of respondents 
had complete measures of all 30 items, the exclusion of respondents with 
missing answers would decrease the sample size substantially. We therefore 
included respondents who had data on at least 25 of the 30 conditions. For 
instance, if a respondent was administered 30 questions and responded pos- 
itively (there is a deficit) to 4 and negatively (no deficit) to 23 of them, then the 
FI for this person is 4/27. Both previous research and preliminary analysis 
showed no difference in the use of the full set of deficits and a reduced set. 
We report analysis from the latter sample with a larger number of observa- 
tions. The FI ranged between 0 and 09 and averaged between 0.14 and 0.19 
across cohorts. Other key variables and sample characteristics are also sum- 
marized in the table. The choice of operational definitions of these variables 
was based on a test of statistical significance for the regression coefficients 
and model fit. As mentioned, the analysis accounted for selection due to 
attrition by controlling for attrition status. 
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Formalities of the Age-Period-Cohort 
Analysis Conundrum and a Generalized 
Linear Mixed Models (GLMM) Framework 


4.1 Introduction 


The three commonly used research designs—tables of rates or proportions, 
repeated cross-sectional sample surveys, and accelerated longitudinal cohort 
designs—described in the previous chapter can be used for age-period- 
cohort (APC) analysis. The use of suitable data alone, however, does not guar- 
antee proper inferences regarding A, P, and C effects. Analytic approaches 
are often critical in determining the attribution of variations in the outcome 
of interest to distinct influences of age, period, and birth cohort. As noted 
previously and discussed in detail in this chapter, the APC underidentifica- 
tion problem, or the “APC conundrum,” has been a point of methodologi- 
cal controversy for decades, with little agreement on a systematic set of 
interrelated models and methods for analysis. Perhaps fueled by this lack 
of agreed-on methodological procedures for analysis, APC problems and 
controversies also have flared up from time to time in empirical analyses of 
substantive questions in the social sciences, some of which have been intro- 
duced in Chapter 3. 

This chapter first reviews the methods of descriptive and statistical analy- 
sis frequently used in previous APC studies. We discuss their utilities and 
limitations. We then lay out the formal algebra of the APC identification prob- 
lem and review conventional approaches to model identification. The many 
pitfalls in empirical APC analysis and lack of major breakthroughs in meth- 
odology led to the verdict by Norval Glenn (2005) that statistical APC models 
cannot be relied on to provide accurate estimates of A, P, and C effects. We 
reevaluate the state of the field and point out the gap in our knowledge. This 
is followed by a sketch of a generalized linear mixed model (GLMM) frame- 
work that unifies the APC models and methods described in subsequent 
chapters for the analysis of data from the aforementioned three research 
designs. This framework and associated analytic and computational tools 
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did not exist when the Mason and Fienberg (1985) synthesis of APC analysis 
in demographic and social research was articulated. We state the rationale 
for the GLMM approach to APC analysis in the concluding section of this 
chapter and provide detailed methodological guidelines on how to conduct 
APC analysis in the chapters that follow. 


4.2 Descriptive APC Analysis 


The vast majority of extant APC studies rely on two descriptive techniques 
to depict the time trends by age, period, and birth cohort. One common 
practice uses summary measures that are independent of age composition, 
such as standardized indices (such as age-standardized death rates) arrayed by 
time periods. This is exemplified in Figures 4.1 and 4.2, which show age- 
standardized lung cancer incidence and death rates, respectively, by sex and 
race calculated for the period between 1969 and 2008, adjusting the crude 
rates to the 2000 population age 20 and above. Increases in lung cancer inci- 
dence and death rates were evident for men only until the 1980s but continu- 
ous for women during the entire period of 40 years. 

An alternative device is a graphical display of the table of age-period-specific 
rates or age-cohort-specific rates. The former is illustrated in Figures 4.3 and 44, 
which show age-specific rates of lung cancer incidence and mortality for 
white males and females across time periods. Consistent with previous 
research, the age-specific rates generally increased steadily from the age 
of 20 to the 70s and then leveled off at older ages. Both males and females 
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FIGURE 4.1 
Age-standardized lung cancer incidence rates by sex and race, United States 1973-2008. 
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FIGURE 4.2 
Age-standardized lung cancer death rates by sex and race, United States 1969-2007. 
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Age-specific lung cancer incidence rates for white males and females by time period, United 
States 1973-2008. 


showed increases in incidence and death rates from lung cancer for the past 
40 years. Whereas the increases for males occurred later in life, were mod- 
erate and reversed in more recent years after 1995, female lung cancer inci- 
dence and death rates increased substantially and continuously from early 
adulthood through old age with no marked decrease. 

Figures 4.5 and 4.6 present the age-specific rates across birth cohorts. 
Changes in logarithmic transform of the rates can be interpreted as the pro- 
portional increase in rates. The results show leveling and declined incidence 
and mortality rates across the majority of male cohorts but sharply increas- 
ing rates across female cohorts born before 1940. 
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FIGURE 4.4 
Age-specific lung cancer death rates for white males and females by time period, United States 
1969-2007. 
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Age-specific lung cancer incidence rates (logarithmic scale) for white males and females by 
birth cohort. 
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Age-specific lung cancer death rates (logarithmic scale) for white males and females by 
birth cohort. 


Although comparisons of trends based on these descriptive measures 
almost always have implications for the A, P, and C effects that underlie the 
observed data (Hobcraft, Menken, and Preston 1982), neither approach explic- 
itly considers an APC modeling framework. Both are descriptive and share 
two disadvantages that are evident in the figures. First, they only describe 
variation in the rates attributable to factors associated with either period of 
death or birth cohort. Second, neither summarizes the corresponding rate 
table satisfactorily. Standardization ignores different trends at different ages 
and graphical methods represent all available rates providing, no summary 
at all (Osmond 1985). In addition, they each have their own limitations. 

Standardized crude rates by definition depend on the age composition 
of a particular period that is used as the standard and give more weight 
to the older ages (Preston, Heuveline, and Guillot 2001; Schoen 1970). The 
resulting rates can be sensitive to the choice of the standard, which changed 
drastically over the past 50 years (see, e.g., Preston, Heuveline, and Guillot 
2001: Table 2.2). Given the substantial population aging that has taken place 
in many nations in recent decades, including the United States, using the 


60 Age-Period-Cohort Analysis 


2000 population as the standard would yield higher rates than using a 
younger population in a much earlier year. As a result, inconsistent trends 
may occur from the use of different standard populations. 

An example of the possible inconsistencies of graphical summaries of rate 
data was given by Holford (1991) in his study of lung cancer incidence rates 
for women aged 20-84 living in Connecticut from 1940 to 1984. He showed 
that the age-specific rates plotted by time periods tended to increase and 
then reach a plateau or even decline in the oldest age groups, which contra- 
dicts the epidemiologic expectation that lung cancer risk would continue to 
increase with age. Age-specific rates plotted by birth cohorts, on the other 
hand, are consistent with such expectation in that the death rates showed 
a steady increase with age for all birth cohorts. In addition, graphs of rates 
from two-way age-by-period or age-by-cohort tables are helpful for qualitative 
impressions about temporal patterns, but they provide no quantitative assess- 
ment of the source of change (Kupper et al. 1985). For example, in Figure 4.7, 
the curve of age-specific lung cancer death rates for white females in any 
given time period, say 1995-99, cuts across a number of birth cohort curves, 
such as 1900, 1905, 1910, and 1920. Therefore, the shape of the period curve is 
affected by both varying age effects and cohort effects. The understanding of 
how these effects operate simultaneously to shift period curve requires the 
use of statistical regression modeling. 
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Age-specific lung cancer death rates for white females by time period and birth cohort. 
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4.3 Algebra of the APC Model Identification Problem 


The essence of APC analysis is the identification and quantification of dif- 
ferent sources of variation in an outcome of interest that are associated 
with age, period, or cohort effects. In addition to descriptive devices, early 
investigators also developed regression models for situations in which all 
three factors account for a substantive phenomenon and in which simpler 
two-factor models (such as an age-period model) are subject to model speci- 
fication errors and spurious results (Mason et al. 1973). The conventional 
linear regression model, also known as the APC accounting/multiple classifica- 
tion model, was introduced to sociologists by Mason and colleagues (1973) 
and serves as a general methodology for cohort analysis when all three of 
age, period, and cohort are potentially of interest. This general methodol- 
ogy focuses on the APC analysis of data using the first research design we 
described in Chapter 3, that is, tables of population rates or proportions, 
such as occurrence/exposure rates of events such as births, deaths, disease 
incidence, crimes, and so on, termed "archival data" by Mason et al. (1973). 
In spite of its theoretical merits and conceptual relevance, APC analysis of 
tabulated data suffers from the "identification problem" induced by the exact 
linear dependency between A, P, and C variables defined in this particular 
data structure. This can be viewed as a special case of collinear regressors 
that produces, in this case, a singular matrix (of one less than full rank) used 
in the statistical estimation process. Since a singular matrix produces mul- 
tiple estimators of the three effects, it is difficult to estimate the unique true 
separate effects. We now discuss this problem in greater algebraic detail. 

We use Table 3.1 of lung cancer incidence rates for U.S. white males as an 
example to study the algebra of the model identification problem. In this 
case, this model can be written in linear regression form as 


R;=l;/P;=u+e +B; + Ye + i (4.1) 


where R; denotes the observed incidence rate for the ith age group for i = 1, 
..., age groups at the jth time period for j = 1, ..., p time periods of observed 
data; I; denotes the number of incidences or newly diagnosed cases of lung 
cancer in the ijth group; P; denotes the size of the estimated population in 
the ijth group, the population at risk; u denotes the intercept or adjusted 
mean incidence rate; o; denotes the ith row age effect or the coefficient for 
the ith age group; D; denotes the jth column period effect or the coefficient 
for the jth time period; y, denotes the kth diagonal cohort effect or the coef- 
ficient for the kth cohort for k = 1, ..., (a + p - 1) cohorts, with k =a — i + j; and 
g; denotes a random error with expectation E(e;) = 0 and var(e;) = o2. 
Conventional APC models as represented in Equation (4.1) fall into the 
class of generalized linear models (GLIM or GLM) (see McCullagh and Nelder 
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1989 or McCulloch and Searle 2001 for expositions) that can take various 
alternative forms. First, model (4.1) can take a log-linear regression form via a 
log link as 


log(E;) = log(Dj) - u +0; * Dj + Ve (4.2) 


where E; denotes the expected number of incidences in cell (i, j) that is 
assumed to be distributed as a Poisson variate, and log(P;) is the log of the 
exposure P; in (4.1) and is called the "offset" or adjustment for the log-linear 
contingency table model. Models of this type are widely used in demogra- 
phy and epidemiology, for which the counts of demographic events such as 
deaths or the incidence of diseases generally follow Poisson distributions, 
and the rates are estimated through log-linear models (Agresti 1996). A 
second alternative formulation of the model, often used in studies of mortal- 
ity, is to treat the underlying number of events (deaths) as a binomial vari- 
ate. The canonical link changes from a log link to a logit link, which yields a 
logistic model: 


el ennt (4.3) 
J 
where 0; is the log odds of death, and m, is the probability of death in cell (i, j). 
This model has been implemented more widely in demographic research 
(e.g., Mason and Smith 1985). 

Regression models (4.1)- (4.3) can be treated as fixed effects generalized linear 
models (GLMs) after a reparametrization to center the parameters: 


Y oM B= neo (4.4) 


After reparameterization, model (4.1) can be written in the conventional 
matrix form of a least-squares regression: 


Y=xXb+e (4.5) 
where Y is a vector of incidence rates or log-transformed rates, X is the 
regression design matrix consisting of “dummy variable” column vectors 


for the vector [of dimension m = 1+ @-1) + (p - 1) + (a + p - 2)] of model 
parameters b: 


b = (u, 04, t OL, Bi, t B, Yi te Wie (4.6) 


with the T superscript denoting vector transposition, and where £ in (4.5) is 
a vector of random errors with mean 0. Note that the parameters o, B,, and 
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Yapı are not included in the parameter vector b because of the constraints 
(4.4) and can be uniquely determined by use of (4.4) in conjunction with each 
estimator of b. In Chapter 5, Section 5.6, we illustrate by an empirical example 
that the use of reference categories is equivalent to the translation by a con- 
stant of the parameter estimates produced by the constraints (4.4) and thus 
of no substantive importance. 

The ordinary least squares (OLS) estimator of the matrix regression 
model (4.5), which also is the maximum likelihood estimator (MLE) when 
the error term is specified as to be normally distributed with expected value 
of zero and constant variance, is the solution b of the normal equations: 


b-(QCXyuY (4.7) 


However, (XTX) does not exist. This is due to the fact that the design 
matrix X is singular with one less than full column rank (Kupper et al. 1985) 
as a result of the perfect linear relationship between the A, P and C variables 
defined in the data design shown in Figure 3.1: 


Period — Age = Cohort. 


Therefore, this OLS/MLE estimator does not exist De, there is no uniquely 
defined vector of coefficient estimates). This is the model identification prob- 
lem of APC analysis. It implies that there are an infinite number of possible 
solutions of the matrix equation (4.7) [i.e., estimators of model (4.5)], one for 
each possible linear combination of column vectors that results in a vector 
identical to one of the columns of X. Therefore, it is not possible to estimate 
the effects of A, P, and C separately without imposing at least one constraint 
on the coefficients in addition to the reparameterization (4.4). 


4.4 Conventional Approaches to the APC 
Identification Problem 


The Mason et al. (1973) article spawned a large body of methodological litera- 
ture in the social sciences, beginning with Glenrn's critique (1976) and Mason, 
Mason, and Winsborough's reply (1976), followed by Fienberg and Mason's 
work (1979). Similar debates occurred between Rodgers (1982) and Smith, 
Mason, and Fienberg (1982). These early investigations culminated in an edited 
volume on cohort analysis by Mason and Fienberg in 1985. Limitations of those 
analytic strategies propelled the search for other approaches. For example, a 
Bayesian approach developed by Nakamura (1982, 1986) and introduced to 
American social scientists by Saski and Suzuki (1987) was again critiqued by 
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Glenn, who cautioned against “mechanical solutions" to the problem (1989). 
The challenge of developing statistical models for APC analysis also stimu- 
lated considerable methodological work in biostatistics and epidemiology 
(e.g., Clayton and Schifflers 1987a,b; Holford 1991; Kupper et al. 1985). Several 
recent reviews provided useful additional material on these and related con- 
tributions to cohort analysis (Mason and Wolfinger 2002; Yang 2007, 2009). 

We discuss next several conventional solutions to the identification prob- 
lem, all of which are confined to conventional linear regression models and 
their GLM extensions. The purpose is often to break the linear dependency 
between the three APC dimensions and thus to identify the model, but each 
solution has limitations. 


4.4.1 Reduced Two-Factor Models 


One possible solution is just to estimate a reduced age and period two-factor 
model that contains no cohort effects and can be written as 


log(E;) = log(I;/ P;) +u +o; +B; (4.8) 


An example of this approach can be found in empirical investigations of 
sources of recent mortality reductions (Yang 20082). Studies are usually con- 
fined to changes in one or two demographic components. The cohort effect 
is less frequently tested, but its presence implies that certain assumptions 
currently used by demographers and other social scientists to analyze fac- 
tors contributing to mortality declines can be misleading. For instance, it 
is frequently assumed that rates of mortality declines over time are equal 
across birth cohorts. It is also assumed that these declines depend on rates of 
changes in period-specific conditions such as economic advance and health 
care technology that are independent of the year of birth. 

Because cohort effects can be interpreted as a special form of interaction 
effect between the categorical age and period variables, model (4.8) rests on 
the assumption of no interaction effect (Fienberg and Mason 1985). In partic- 
ular, the expected rate in age-by-period cell (i, j) is modeled a function of the 
marginal or gross effects of age i and period j only, and not also of the cell- 
specific effect, such as y,.;,;, which is a function of both i and j. Violation of 
this assumption can be detected by plots of age-specific rates by time period, 
and a lack of parallelism among these curves suggests that birth cohort effects 
are operating (Kupper et al. 1985). The same applies to the period effect as a 
particular type of age-cohort interaction (Holford 1991). Figures 4.3 and 44 
show evidence of nonparallelism among age curves by period in plots of 
lung cancer incidence and mortality, confirming the existence of birth cohort 
effects. Age curves of incidence and mortality rates by cohort also show 
nonparallelism. Therefore, it is likely that period effects are also important 
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in lung cancer trends. So, results of preliminary graphical analyses argue 
against the plausibility of reduced two-factor models. Regression analyses 
comparing model fit of reduced and full APC models further suggest that 
the latter is favored by taking all three variables into account (see Chapter 5). 
Therefore, the assumption of no cohort effects greatly simplifies estimations 
but can lead to model misspecification and is inconsistent with accumulating 
evidence of cohort changes in a variety of health outcomes and mortality. 


4.4.2 Constrained Generalized Linear Models (CGLIMs) 


Since the work of Fienberg and Mason (1979, 1985), the conventional and 
most widely used approach to estimating model (4.5) has been a coefficients- 
constraints approach, which takes the form of placing one or more additional 
identifying constraint (usually an equality constraint) on the parameter vec- 
tor (4.6) to just identify or overidentify the model. For example, in an APC 
analysis of U.S. tuberculosis mortality, Mason and Smith (1985) placed the 
constraint by equating the contrasts for ages 0-9 and 10-19. This yielded 
unique estimates for age, period, and cohort effects. Similarly, in the case of 
lung cancer incidence rates shown in Table 3.1, one can constrain the effect 
coefficients of the first two age groups (20-24 and 25-29) to be equal, o = oo 
or two periods (1973-74 and 1980-84) to be equal, B, = B,. With this one addi- 
tional constraint, the model (4.5) is just identified, the matrix (XTX) becomes 
nonsingular, and the least squares estimator (4.7) exists [as do related MLEs 
for models (4.1), (4.2), and (4.3)]. Overidentified constraints extend the just- 
identified constraints and in essence group multiple ages, periods, or cohorts 
into less-refined categories. A common CGLIM (constrained generalized 
linear model) approach is to create time periods or birth cohorts in longer 
intervals than the interval for age. The use of differential temporal grouping 
of the three variables breaks their exact linear dependency. 

The main problems with this CGLIM approach have been discussed in 
a large methodological literature in demography, epidemiology, and sta- 
tistics. First, the analyst needs to rely on external or side information to find 
constraints, but such information often does not exist or cannot easily be 
verified (Mason and Wolfinger 2002). Second, different choices of identify- 
ing constraints can produce widely different estimates of patterns of change 
across the A, P, and C categories. As demonstrated, for example, by Mason 
and Smith (1985) and Yang, Fu, and Land (2004), estimates of model effect 
coefficients are sensitive to the choice of the equality coefficient constraint. 
In the case of overidentification, the estimates may also be sensitive to the 
choice of interval widths that the analysts used to group variables because 
longer widths allow higher degrees of overidentification. There are addi- 
tional problems with differential interval grouping of A, P, and C variables 
that we discuss in Chapter 7. Third, all just-identified models will produce 
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the same levels of goodness of fit to the data, making model fit a useless cri- 
terion for selecting the best just-identified constrained model: 


4.4.3 Nonlinear Parametric Transformation 


The nonlinear parametric transformation approach defines a nonlinear 
parametric or algebraic function for at least one of the A, P, and C variables 
so that its relationship to others is nonlinear (Mason and Fienberg 1985). This 
requires reformulating model (4.1) in terms of continuous A, P, and C vari- 
ables, which allows for the specification of polynomials in one or more of 
these variables. For example, one can specify a quadratic or cubic function of 
age to model the nonlinear rate of change in verbal ability (Yang and Land 
2006) or happiness with age (Yang 2008b). The drawbacks of this approach 
are that (1) specifying a functional form of one or more of the APC dimen- 
sions makes the model less flexible, and (2) it may not be clear what nonlinear 
function should be defined. More important, the use of this approach alone 
does not completely solve the identification problem. As Mason and Fienberg 
(1985) pointed out, the linear terms of these variables are still unidentified. 


4.4.4 Proxy Variables 


The proxy variables approach uses one or more proxy variables to replace 
the A, P, or C variable. This is a popular approach because of its substantive 
appeal. After all, the indicator variables of A, P, and C serve as surrogates for 
different sets of unmeasured structural correlates (Hobcraft, Menken, and 
Preston 1982). Examples of proxy variables for cohort effects include relative 
cohort size (O'Brien, Stockard, and Isaacson 1999) and cohort mean years 
of smoking before age 40 (Preston and Wang 2006). The use of cohort char- 
acteristics to replace cohort effects in APC accounting models has recently 
been labeled the age-period-cohort characteristics (APCC) model by O'Brien, 
Stockard, and Isaacson (1999). Unemployment rates, labor force size, and 
gender role attitudes have been used as proxy variables for period effects 
(Pavalko, Gong, and Long 2007). 

But, there are caveats in the employment of this approach: (1) Proxy vari- 
ables should not be linearly related to cohort or period. Otherwise, they will 
be highly collinear with the other two effects, just as the effects they are 
intended to replace in the models (Glenn 2005). (2) As Mason and Smith (1985) 
clearly indicated, in the case of interactions that can affect the coefficients of 
the included variables, the use of measured variables will not provide any 


` Goodness-of-Fit and model selection statistics can be useful for ascertaining the extent 
to which one, two, or three of the age, period, and cohort temporal dimensions are neces- 
sary to model the patterns in a given age-by-time period array of rates or proportions; see 
Chapter 5, Section 5.5.2. 
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particular advantage. (9) Even in the absence of any interaction, the use of 
measured variables is not necessarily superior, as Smith, Mason, and Fienberg 
(1982: 792) cautioned, for replacing cohort or period with a measured variable 
“leaves open the question of whether all of the right measured variables have 
been included in an appropriately wrought specification. Although replacing 
an accounting dimension with measured variables solves an identification 
problem, it makes room for others." Thus, assuming that all of the varia- 
tion associated with the A, P, or C dimension is fully accounted for by the cho- 
sen proxy variables may be unwarranted. We test this assumption empirically 
in subsequent analyses using the GLMM approach by entering key cohort and 
period-level covariates previously hypothesized to proxy cohort and period 
effects in studies of mortality and health. For example, we compare estimates 
of cohort variance before and after the adjustment of the cohort mean years of 
smoking, which was used to replace the cohort effects in a recent study of U.S. 
adult mortality (Preston and Wang 2006). The lack of reduction or elimination 
of cohort variance then suggests that the proxy cohort effect variable chosen 
does not completely account for the cohort effect as expected and hence is not 
sufficient to represent the cohort effect. 

Winship and Harding (2008) recently proposed a mechanism-based 
approach that accommodates a more general set of models. Using the frame- 
work of causal modeling described by Pearl (2000) to achieve identification, 
the approach allows any given measured variable to be associated with 
more than one of the age, period, and cohort dimensions and provides statis- 
tical tests for the plausibility of alternative restrictions. If a rich set of mecha- 
nism variables is available and the original age, period, and cohort categories 
can be conceived as the exogenous elements of a causal chain (Smith 2008), this 
is an enriched and sophisticated alternative to the proxy variable approach. 


4.4.5 Other Approaches in Biostatistics 


In biostatistics and epidemiology, a number of solutions to the identification 
problems have also been proposed over the past two decades. There are three 
broad classes of solutions other than the CGLIM approach. The first and 
most widely adopted one uses estimable functions, deviations, curvatures, 
and drift and focuses solely on the nonlinear rather than linear components 
of trends (Clayton and Shifflers 1987a,b; Holford 1983; Tarone and Chu 1992). 
The second is based on the use of minimizing a penalty function to derive 
the necessary extra linear constraint (Osmond and Gardner 1989; Fu 1998, 
2000). The third includes the methods that use individual records of cases to 
construct a three-way APC table (Robertson and Boyle 1998). Reviews and 
comparisons of these methodologies are available in the work of Robertson, 
Gandini, and Boyle (1999) and Holford (2005). Generally, the statistical limi- 
tations of existing approaches have been acknowledged, and analysts have 
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been advised that any statistical modeling of APC data should be carried 
out in conjunction with a detailed descriptive analysis (Kupper et al. 1985; 
Mason and Smith 1985). Substantively, there also is a lack of clear and parsi- 
monious interpretation of the results in terms of the A, P, and C trends. And, 
similar to the conventional solutions in social sciences, they are confined to 
linear models (LMs) that necessarily incur the identification problem. 


4.5 Generalized Linear Mixed Models (GLMM) Framework 


Various analytic approaches mentioned have produced ambiguous and 
inconsistent results. Researchers do not agree on methodological solutions to 
these problems and conclude that APC analysis is still in its infancy (Kupper 
et al. 1985; Mason and Wolfinger 2002) Where does the early literature 
on APC analysis leave us today? If a researcher has a temporally ordered 
dataset and wants to tease out its age, period, and cohort components, how 
should the researcher proceed? Can any methodological guidelines be rec- 
ommended? A problem with much of the extant literature is that there is a 
lack of useful guidelines on how to conduct an APC analysis. Instead, one 
is left with the impression that either it is impossible to obtain meaningful 
estimates of the distinct contributions of age, cohort, and time period (Glenn 
2005) or that the conduct of an APC analysis is an esoteric art that is best left 
to a few skilled methodologists (Yang 2009). 

We seek to redress this situation by focusing on recent developments in 
APC analysis for three common research designs. The guiding principle of 
our work is a famous quotation from the celebrated statistician George E. P. 
Box (1979: 202), emeritus professor of statistics at the University of Wisconsin 
at Madison: "All models are wrong but some are useful." Our version of this 
statement is that all statistical models are wrong but, some models in APC 
analysis have better statistical properties and employ constraints that are 
more reasonable and substantively relevant than others, which may make 
them useful. We show that the models we present in this volume possess 
good statistical properties in comparison to alternatives. Nonetheless, as 
stated by Yang et al. (2008: 1733), every statistical model has its limits and 
will break down under some conditions. The models presented here are no 
exception, and we try to identify some conditions under which they become 
unstable and break down. 

We begin with the observation that the identification problem is not inevi- 
table in all settings—only in the case of conventional linear accounting mod- 
els that assume additive fixed effects of A, P, and C. In a special issue of 
Sociological Methods & Research (2008) that collected recent new developments 
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of APC models in sociology, Wenjiang Fu (2008) insightfully pointed out 
that the APC conundrum is “not data specific but model specific." Second, 
LMs also have conceptual limitations. Assuming additivity of A, P, and C 
effects, such models may be poor approximations of the processes generat- 
ing changes. Third, LMs in the form of an APC accounting model cannot 
include explanatory variables and test substantive hypotheses. These major 
limitations suggest the use of different families of models that are not subject 
to the identification problem and are perhaps suitable for addressing many 
new research questions. A useful alternative is a family of models that (1) do 
not assume fixed A, P, and C effects that are additive and therefore avoid the 
identification problem; (2) can statistically characterize contextual effects of 
historical time and cohort membership; and (3) can accommodate covari- 
ates to aid better conceptualization of specific social and biological processes 
generating observed patterns in the data. 

Over three decades ago, Nelder and Wedderburn (1972; see also McCullagh 
and Nelder 1989) introduced the class of GLMs as a generalization of classi- 
cal LMs. GLMs facilitate the regression analysis of data that are not normally 
distributed and the specification of nonlinear link functions that specify the 
expected value of the dependent or response variable as a nonlinear func- 
tion of parameters; the class of GLMs thus includes LMs as a special case. As 
noted by McCulloch and Searle (2001: 2), the last 30 years have seen GLMs 
extended further to the class of GLMMs, which permit the specification of 
some parameters as fixed and others as random. 

It is clear that the conventional models for APC analysis that we have dis- 
cussed thus far are LMs or GLMs that are members of the GLMM class. In 
the next chapter, we describe a recently developed method of estimation, the 
Intrinsic Estimator (IE), to identify APC accounting models of the form of 
(4.5) utilizing Moore-Penrose generalized inverse and an estimable function 
(Fu 2000; Yang, Fu, and Land 2004; Yang et al. 2008). The IE, then, is a pure 
fixed effects LM if the response variable is treated as normally distributed; if 
the response variable is an incidence or occurrence/exposure rate and there- 
fore more properly modeled by a log-linear or logit link, then the IE becomes 
a GLM. 

A mixed effects modeling framework has been developed for APC anal- 
ysis. Specifically, the class of hierarchical APC (HAPC) models initially was 
proposed (Yang and Land 2006, 2008; Yang 2006) to examine microdata in 
the second research design described in Chapter 3, repeated cross-sectional 
surveys, but they also can be readily applied to aggregate data in the first 
research design (O'Brien, Hudson, and Stockard 2008). The HAPC approach 
conceptualizes time periods and cohort memberships as social histori- 
cal contexts within which individuals are embedded and ordered by age 
and models them as random as opposed to fixed effects additive to that 
of age. This contextual approach broadens the theoretical foundation of 
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APC analysis, helps to deal with (actually completely avoids) the identifica- 
tion problem, and accounts for potentially correlated errors. As shown in 
Chapter 7, HAPC models are mixed effects models. While HAPC models 
applied to the General Social Survey (GSS) verbal test score data described in 
Chapter 3 use a linear link and Gaussian errors, we can extend such applica- 
tions within the GLMM framework to allow a flexible choice of nonlinear 
link functions and nonlinear errors, as are the cases for GSS happiness data, 
NHANES (National Health and Nutrition Examination Survey) obesity data, 
and SEER (Surveillance, Epidemiology, and End Results) cancer registry and 
mortality data of Chapter 3. In this way, HAPC models can accommodate a 
larger range of data used in social behavioral, life, and animal sciences. 

Hierarchical or mixed effects models are similarly useful tools for modeling 
accelerated longitudinal data in the third research design. Because repeated 
observations over time (level 1 units) can be viewed as nested within indi- 
viduals (level 2 units) from multiple cohorts followed over time, one can use 
the individual growth/change model, a specification of mixed effects mod- 
els, to assess simultaneously the intracohort age changes and intercohort dif- 
ferences. We show in Chapter 7 the estimation of individual growth curve 
models for cohort analysis of data on physical and mental health. 

In sum, it is clear that the class of GLMMs is an appropriate context within 
which to view the newly developed APC models and within which to further 
explore APC analysis. Using the GLMMs framework, we seek to develop an 
integrated statistical methodology for APC analysis that yields consistent 
models and substantive inferences across all three research designs. We 
make comparisons of inferences and conduct model validation/robustness 
analyses across different research designs and model specifications. There 
is, of course, no reason that APC inferences must be identical across all data 
and modeling formats. But, as statistical methodologists, it is incumbent on 
us to identify and explore the similarities, differences, and constraints that 
may apply to the more limited data designs (repeated cross sections, tabu- 
lated rates) as compared to a full accelerated longitudinal cohort design. This 
should yield some guidelines that may be useful in empirical studies. 

Indeed, since the APC underidentification problem is an instance of a 
larger family of structural underidentification problems, the potential range 
of application of the GLMM may be even larger. Structural underidentification 
problems occur when a conceptualization of effects of structural arrange- 
ments leads to an exact linear dependency among the effects. An example 
is the classical problem in mobility analysis of distinguishing the effects of 
socioeconomic mobility or distance moved on an outcome variable from the 
effects of origin and destination statuses (see, e.g, Duncan 1966). Another 
example pertains to the estimation of the effects of years of labor force expe- 
rience separately from current age and age at labor force entry. These and 
similar problems of structural underidentification occur frequently in sociol- 
ogy and related disciplines. The new models and methods may thus prove 
useful in applications to such problems. 
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5 


APC Accounting/Multiple Classification 
Model, Part I: Model Identification and 
Estimation Using the Intrinsic Estimator 


5.1 Introduction 


The age-period-cohort (APC) accounting /multiple classification model 
described by Mason et al. (1973) has served for over three decades as a gen- 
eral methodology for estimating A, P, and C effects in demographic and 
social research. It also has been the dominant model for APC analysis in bio- 
statistics and epidemiology (Holford 2005). This methodology uses conven- 
tional linear regression models or their generalized linear model extensions 
and has mainly been applied to data using the first research design (tables of 
rates or proportions) described in Chapter 3. The problem of identification of 
model parameters that arises from the application of APC accounting mod- 
els to tables of rates or proportions has long been recognized and deemed 
difficult to deal with, if not completely unsolvable, in the social science and 
biostatistics communities. 

This chapter focuses on an innovative approach to the model identifica- 
tion and estimation problem within the linear/generalized linear models 
framework for APC analysis. Incorporating recent methodological develop- 
ments on estimable functions in the context of the APC accounting model, a 
new method of estimation termed the Intrinsic Estimator (IE) was proposed 
and compared to the conventional coefficient constraints approach, that is, 
the constrained generalized linear model (CGLIM) (Fu 2000; Yang, Fu, and 
Land 2004). Subsequent methodological studies have further shown that the 
IE performs well compared to conventional approaches to the identification 
problem (Yang et al. 2008; Fu, Land, and Yang 2011). To date, the IE has been 
empirically applied in a number of studies in epidemiology, demography, 
and social science and has yielded sensible results on age, period, and cohort 
trends in a variety of topics, including both general and cause-specific mor- 
tality rates (Yang 2008); homicide arrest rates (Fu, Land, and Yang 2011); rates 
of religious activities and beliefs (Schwadel 2011); mortality due to accidental 
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poisoning (Miech, Koester, and Dorsey-Holliman 2011); trends in marijuana 
use (Miech and Koester 2012); and deer hunter demography (Winkler, Huck, 
and Warnke 2009). This chapter defines the IE, describes its statistical proper- 
ties, and illustrates the robustness of these properties via a comparison with 
age, period, and cohort estimates from an empirical application of a hierar- 
chical APC (HAPC) model (the subject of Chapters 7, 8, and 9) and from a 
numerical simulation analysis. Because an understanding of the numerical 
and statistical properties of the new estimator requires a substantial math- 
ematical background, readers who are not familiar with linear algebra or 
have no formal training in advanced statistical methods can choose to skip 
the latter part of the chapter and turn to the empirical applications of the 
APC accounting model described in Chapter 6. 


5.2 Algebraic, Geometric, and Verbal Definitions 
of the Intrinsic Estimator 


The objectives of this section are to formally define the IE, review proper- 
ties of the IE as a statistical estimator, report results of model validation 
assessments of the IE both from an empirical example and from a simula- 
tion exercise, give some "how-to-use" advice, and show how to relate the 
coefficients of the IE to those of conventional constrained APC models with 
applications to U.S. female cancer mortality rates, 1969-2007." Since the IE 
is a general-purpose method of APC analysis of potentially wide applica- 
bility in the social, life, and animal sciences, it merits recalling the criteria 
for acceptability of such a general-purpose method articulated by Norval 
Glenn, a long-time critic of attempts to provide general solutions to the APC 
analysis problem. Glenn (2005: 20) stated that such a method "may prove to 
be useful ... if it yields approximately correct estimates ^more often than not/ 
if researchers carefully assess the credibility of the estimates by using theory 
and side information, and if they keep their conclusions about the effects 
tentative." These are strong criteria, and we agree with them. The purpose of 
this and the next sections is to assess the extent to which the IE satisfies them. 

The consensus has been that the key problem for APC analysis in the con- 
ventional linear accounting/multiple classification model is to identify an 
estimable function that uniquely determines the parameter estimates. But, 
the controversy continues whether there exists such an estimable function 
that solves the identification problem. The conventional wisdom is that only 
the nonlinear, but not the linear, components of APC models are estimable 


` Parts of Section 5.2 and Appendices 5.1 and 5.2 are adapted from Yang, Y., W. J. Fu, and K. C. 
Land. 2004. Sociological Methodology 34:75-110 and Yang, Y., S. Schulhofer-Wohl, W. J. Fu, and 
K. C. Land. 2008. American Journal of Sociology 113:1697-1736. 
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(Holford 1985; Rodgers 1982a). As noted by Fu (2008), however, there have 
been only numeric demonstrations, but no rigorous proofs, to support the 
idea that no estimable function exists. It should also be noted that Kupper 
et al. (1985) provided a condition for estimable functions (see Section 5.2) and 
suggested that an estimable function satisfying this condition resolves the 
identification problem. Subsequent publications have shown that the IE sat- 
isfies this condition and estimates the unique estimable function, including 
both the linear and the nonlinear components of the parameter vector of the 
multiple classification model (Fu 2000; Fu and Hall 2006; Fu, Hall, and Rohan 
2004; Yang, Fu, and Land 2004). 

Within the context of the description of the algebra of the APC identifica- 
tion problem in conventional linear/generalized linear APC regression mod- 
els given in Chapter 4, we next describe the IE in three ways: algebraically, 
geometrically, and verbally. 


5.2.1 Algebraic Definition 


First, as concerns the algebraic definition of the IE, Yang, Fu, and Land (2004) 
showed that, because the design matrix X is one less than full-column rank, 
the parameter space of the unconstrained APC regression model (4.5) of 
Chapter 4 can be decomposed into the direct sum of two linear subspaces 
that are perpendicular to each other. One subspace corresponds to the unique 
zero eigenvalue of the matrix XTX of equation (4.7) and is of dimension one; it 
is termed the null subspace of the design matrix X. The nonnull subspace is the 
complement subspace orthogonal to the null space. 

Due to this orthogonal decomposition of the parameter space, each of the 
infinite number of solutions of the unconstrained APC accounting model 
(4.5) can be written as 


b=B+sBy (5.1) 


where s is a scalar corresponding to a specific solution, and B, is a unique 
eigenvector of Euclidean norm or length one. The eigenvector B, does not 
depend on the observed rates Y, only on the design matrix X, and thus is 
completely determined by the number of age groups and period groups— 
regardless of the event rates. In other words, B, has a specific form that is a 
function of the design matrix. To give an explicit representation of By, note 
that the exact linear dependency between A, P, and C variables in model (4.5) 
is mathematically equivalent to 


XB, -0 (5.2) 


This equation expresses the property that X is singular, that is, there exists a 
linear combination of the columns of the design matrix X that equals a zero 
vector. Kupper et al. (1985) showed that B, has the algebraic form 
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B G (5.3) 


that is, B, is the normalized vector of Bo: 


B, =(0,A,P,C)* (5.4) 
where 
al E ug D 2 
2 
Jet 1 PH T D) 
and 


with a, p, and c denoting the number of age categories, time periods, and 
cohorts, respectively, in the age-by-time period array of rates. 

It is important to note that the vector B, is fixed or nonrandom because it 
is a function solely of the dimension of the design matrix X or the number of 
age groups a and periods p and independent of the response variable Y. This 
can be illustrated with a specific numerical example. Suppose a = 3 and p = 3. 
Then from Equation (5.4) we have 


A-(1- 3, 2-331) 2 (1, 0) 


P-(32-1,32-2)-(Q, 0) 


C=(1- 38, 2-38, 3-38, 4-333) - (2, -1,0, 1) 


which yields 
By = (0, -1, 0, 1, 0, -2,-1,0, 1) 


We can then compute the vector B, as follows: 


By B, 


= (0, —0.354, 0, 0.354, 0, —0.707, —0.354, 0, 0.354)" 
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where (By Bo) 2 28'?, The fact that the trend estimates should be com- 
pletely determined by Y (and hence are intrinsic to Y) means that arbitrary 
terms associated with B, should be removed. But the conventional CGLIM 
approach violates this principle if the scalar s in Equation (5.1) is nonzero. 

The idea that B, should not affect the results is a key point, as intuition 
suggests that the eigenvector corresponding to the zero eigenvalue would 
be an arbitrary vector. And, indeed, sB, is arbitrary. But B, is not arbitrary; 
it is fixed by the design matrix. Furthermore, by Equation (5.1) any APC 
estimator, obtained by placing any identifying constraints on the design 
matrix, can be written as a linear combination B + sB,, where B is the special 
estimator termed the IE that lies in the nonnull parameter subspace and is 
determined by the Moore-Penrose generalized inverse.” 

One computational algorithm for the IE is based on an orthonormal trans- 
formation of a principal components regression (PCR) (for a standard exposition 
of PCR, see, e.g., Sen and Srivastava 1990) consisting of the following steps: 


(a) Compute the eigenvectors u,, ..., U„ of matrix XTX, where m denotes 
the number of rows (columns) of the XTX matrix be, m = 1 + (a — 1) 
+ (p - 1) + (a + p - 2). Normalize them to have unit length with ||u 
and denote the orthonormal matrix as U = (u,, ..., HA: 


nll 


(b) Identify the special eigenvector B, corresponding to eigenvalue 0. 
Denote u, = By without loss of generality; 


(c) Select the principal components to be the remaining eigenvectors 
Uy, ..., U,, with nonzero eigenvalues; 


(d) Estimate a PCR model with the outcome variable of interest 
(e.g., logged death rates) as the response using a design matrix V 
whose column vectors are the principal components i5, ..., Um that 
is, V = (uy, ..., Hal, to obtain the coefficients (wp, ..., w,); and 


(e) Set coefficient w, = 0 and transform the coefficients vector w = 
(Wi, .., w,)! by the orthonormal matrix of all eigenvectors U = 
(th, ..., Um) to transform the coefficients of the PCR model to obtain 
the Intrinsic Estimator B = Uw. 


Instead of using reference categories, the IE uses the “usual analysis-of- 
variance (ANOVA) type constraints” that the sums of the respective A, P, and 
C coefficients equal zero, termed effect coding. The computational algorithm 
used by the IE estimates the resulting effect coefficients for each of the a - 1, 
p - 1, and a + p -2 A, P, and C categories, respectively, which is consistent 
with the definition of the parameter vector in Equation (4.1) of Chapter 4. 
Then, the IE uses the zero-sum constraints to obtain the numerical values of 
the deleted A, P, and C categories. 


` See, for example, Searle (1971: 16—19) for a definition of the Moore-Penrose generalized inverse 
and its properties. 
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Two other perspectives on the IE are useful for interpreting and calculat- 
ing the IE. First, Fu (2000) showed that the IE can be viewed as a special 
case of the classical ridge estimator (Hoerl and Kennard 1970a, 1970b) for 
the conventional linear regression model that is used when the regressors 
are highly collinear. The ridge estimator shrinks the least squares estima- 
tor toward zero as a function of a shrinkage parameter A > 0, thus correcting 
the tendency of ordinary least squares (OLS) to produce estimated coefficients 
that are too far from zero when the regressors are highly correlated. This pro- 
duces a biased estimator that has a smaller mean-squared error than the least 
squares estimator, thus trading off bias for a reduction of error of estimation. 
Fu (2000) studied the ridge estimator in the singular design case, where the 
design matrix X has one less than full rank, of which the design matrix for 
the APC accounting model is an example, and showed (1) that the ridge esti- 
mator lies in a subparameter space orthogonal to the null space of the design 
matrix generated by the eigenvector of the zero eigenvalue, and (2) that the 
ridge estimator converges to the IE as the shrinkage parameter À tends to 
0. In other words, the IE can be interpreted as the limit of the ridge estima- 
tor as its shrinkage penalty goes to zero. Second, Tu, Kramer, and Lee (2012) 
studied the application of partial least squares (PLS) to the APC accounting 
model. Whereas PCR extracts the components independently of the outcome 
variable, PLS maximizes the covariance of the components with the out- 
come variable Y, extracting the components by order of this covariance from 
the highest to the lowest. Tu, Kramer, and Lee (2012) showed with an empirical 
application that, as the number of components extracted by PLS approaches 
the maximum number possible for a design matrix, the numerical values of 
the PLS estimates of the age, period, and cohort effect coefficients approach, 
and are within sampling error of, the corresponding coefficients estimated 
by the IE (which uses the maximum possible number of components). They 
also showed that an estimator based on the first three PLS components is 
a numerically reasonable approximation to the PLS effect coefficients esti- 
mated by using the maximum possible number of components. 


5.2.2 Geometric Representation 


The parameter space of the unconstrained vector b of model (4.5) of Chapter 4, 
P, can be decomposed into the direct sum of two linear subspaces that are 
orthogonal (independent) to each other: P = N @ 6, where one subspace 
(N) denotes the null space of X defined by the vector Bọ corresponding to 
the unique zero eigenvalue of the matrix XTX, and the other subspace (©) 
denotes the complement nonnull space orthogonal to N. The parameter vec- 
tor decomposition is 

b = b, + sBo (5.5) 


where b, = Bach is a special parameter vector that is a linear function of b 


corresponding to the projection of the unconstrained parameter vector b to 
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6, the nonnull space of X. Specifically, the special parameter vector b, cor- 
responding to s = 0 satisfies the geometric projection 


by = (I — B,BT)b (5.6) 


Note that B, represents a special direction in the parameter space defined 
by the difference between two arbitrary CGLIM estimators, b, — b;: 


Sib, —b,)= X(sBy) - 0 (5.7) 


where s is an arbitrary real number, and sB, represents arbitrary linear 
trends. Thus, the difference of any two arbitrary CGLIM estimators must 
be in the null space of X, that is, the space defined by Bọ. Different equality 
constraints of the CGLIM estimators assign different values to s. And, the 
arbitrary term in these parameter vectors, sB, leads to different estimates. 
This projection is illustrated in Figure 5.1, which shows the projection of 
two parameterizations, b, and b,, onto the nonnull parameter space (the ver- 
tical axis in Figure 5.1), which is independent of the real number s. The 


äere i x > 


0 Bo sBo 


FIGURE 5.1 

Geometric projection of parameters vectors. Note: b,, b,, and b, are three sets of parameter vec- 
tors that result from the imposition of various constraints on elements of the parameter vector 
b of the APC regression model; sB, represents the arbitrary term in these parameter vectors; 
projection of b,, b,, and b, to the vertical axis yields the same estimable function bh, The vertical 
vector b, is orthogonal to and therefore independent of the component sB,, which represents 
effects of the design matrix. While b, is estimable, none of the others, b, + sBy with s z 0 such as 
b,, by, or b, are. As shown later in the text, however, a statistical test can be defined and applied 
to ascertain whether or not the vector of estimated coefficients corresponding to constraints 
such as those used to define, say, b, are sufficiently close to those corresponding to by, as esti- 
mated by the IE, that it can be concluded that the scalar s = 0, and, correspondingly, that the 
equality constraint that produced b, also produces an estimable function in a statistical sense. 
(Adapted from Yang, et al. 2008: Figure 1.) 
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geometric representation in Figure 5.1 can be thought of as either a simple 
parameter space of dimension two or as multidimensional with the verti- 
cal axis representing a direction in a multidimensional nonnull space. In 
either case, since the projection of any parameterization of b yields the same 
parameter vector bọ the latter is estimable. Figure 5.1 provides only a geo- 
metric illustration. 

Figure 5.1 also helps to illustrate geometrically that the IE may, in fact, also 
be viewed as a constrained estimator. But, in contrast to the equality con- 
straints on two or more coefficients of the parameter vector b that are imposed 
in conventional approaches to the estimation of APC accounting models, 
the constraint imposed by the IE to identify model (4.5) is a constraint on the geo- 
metric orientation of the parameter vector b in parameter space. Specifically, the 
IE imposes the constraint that the direction in parameter space defined by 
the eigenvector B, in the null space of the design matrix X has zero influence 
on the parameter vector bọ that is, on the specific parameterization of the 
vector b that is estimated by the IE. Since B, is a fixed vector that is a function 
solely of the design matrix and does not depend on the observed event rates 
or frequencies being analyzed, this would seem to be a reasonable constraint. 

Corresponding to the projection of the parameter vector b onto b,, we have 
the following projection of the estimators of Equation (5.6) onto the IE: 


^ 


B - (I- BR (5.8) 


This equation provides another algorithm for computing the IE, namely, 
compute an initial estimator b of model (4.5), say by an equality constraint 
on two of the A, P, and C parameters, and then geometrically project b to the 
IE B by removing the component in the B, direction. Combining Equations 
(5.1) and (5.2) yields 


Xb = X(B + tBo)= XB+tXBy = XB+0=XB (5.9) 


This shows that the IE B is the estimator that uniquely estimates the A, D, and 
C effects in the constrained parameter vector b, = P,,,.;b. This point merits 
emphasis: The IE does not estimate the unconstrained coefficient vector b 
of the APC accounting/multiple classification model; rather, it estimates the 
projection of the unconstrained vector onto the nonnull space of the design 


matrix X. 


5.2.3 Verbal Description 


Statisticians have known since the work of Kupper et al. (1985) that the 
dimension of the design matrix in the APC accounting model (i.e., the num- 
ber of age groups and time periods) may affect the estimates obtained from 
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CGLIM estimators. Put in the simplest possible terms, the basic idea of the IE 
is to remove the influence of the design matrix on coefficient estimates. As 
noted in Section 5.3 and the appendices of this chapter, this approach pro- 
duces an estimator that has desirable statistical properties. 

The IE also can be viewed as a special form of PCR estimator, as shown 
in Section 5.2.1, that removes the influence of the null space of the design 
matrix X on the estimator. It specifically estimates a constrained parameter 
vector b, that is a linear function of the parameter vector b of the uncon- 
strained APC accounting model (4.5). This constrained parameter vector b, 
corresponds to the projection of the unconstrained parameter vector b onto 
the nonnull subspace of the design matrix X. 

Since the IE is a principal components estimator, one might well ask: Why 
not just calculate the eigenvectors of the matrix XTX by application of prin- 
cipal components, regress the observed rates on the subspace spanned by 
these eigenvectors, and leave it at that? The answer is that regression coeffi- 
cient estimates in this subspace are not directly interpretable in terms of A, P, 
and C effects. Therefore, the IE uses the extra step of inverse orthonormal 
transformation of the coefficient estimates of the PCR back to the original 
space of A, P, and C coordinates. The inverse transformation is what makes 
the IE a special form of principal components estimator. It yields coefficients, 
as illustrated by empirical applications in Section 5.4 and in Chapter 6, that 
are directly interpretable as A, D, and C effects and that can be compared 
to corresponding effects estimated by the conventional imposition of equal- 
ity constraints on parameters. 


5.2.4 Computational Tools 


Programs for estimating the IE can be written as add-on files to commercially 
available software packages. We provide codes for users of R and Stata in the 
online computational guides that accompany some analyses shown in the 
following material. An ado-file to calculate the IE in Stata may be obtained 
bytyping"ssc install apc"ontheStata command line on any computer 
connected to the Internet or by downloading from the Statistical Software 
Components archive at http://ideas.repec.org/s/boc/bocode.html. The pro- 
gram uses much the same syntax as Stata's glm command for generalized 
linear models. For example, a user whose dataset contains a dependent vari- 
able y, an exposure variable x, an age variable a, and a period variable t can 
fit a Poisson model with age, period, and cohort effects by typing 


apc ie y, exposure(x) family(poisson) link(log) age(a) period(t) 


Stata will then display statistics usually seen in applications of g1m com- 
mand. The program is documented more fully in a help file that can be read by 
typing “help apc ie" in Stata after installation. The package also includes 
a command "apc cglim" for calculating CGLIM estimators for readers 
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interested in methodological comparisons. This package, however, is not rec- 
ommended for use in empirical analysis due to its previously stated problems. 


5.3 Statistical Properties 


In general, it is desirable that estimators of coefficients in statistical models 
be unbiased in finite samples, relatively efficient in finite-time-period analy- 
ses in the sense of having a sample variance, or mean squared error (MSE) in 
the case of biased estimators, that is smaller than that of other estimators and 
consistent or asymptotically unbiased in large samples (Casella and Berger 
2001). We now examine the IE with respect to these properties. While readers 
are encouraged to read and understand all of these properties, those who are 
not familiar with linear algebra and asymptotics can skip the proofs in the 
appendices and Section 5.3.2. 


5.3.1 Estimability, Unbiasedness, and Relative Efficiency 


To study the statistical properties of the IE, consider first the role of esti- 
mable functions in APC accounting models. A large part of discussions of 
the parameter identification issue in APC models, which may be confus- 
ing and bewildering, pertains to what exactly it is for which an estimator is 
sought. The long methodological discussion of the APC identification and 
estimation problem in the 1970s and 1980s was centered on the presumption 
that the objective was to estimate the unidentified coefficient vector of the 
APC multiple classification model—the b vector in Equation (4.5)—and, as 
noted in Appendix 54, this search continues to motivate some methodologi- 
cal work today. But, the unconstrained b vector is not estimable in the sense 
of satisfying the estimability condition described in the following discus- 
sion. Accordingly, a concluding point of those methodological discussions 
was that only estimable functions of the model parameters should be recom- 
mended as they provide unbiased estimation of special characteristics of the 
model parameters (Kupper et al. 1983, 1985). Nonlinear estimable functions 
also have been applied to the APC accounting model (Clayton and Schifflers 
1987; Holford 1985, 1991; Robertson, Gandini, and Boyle 1999), although the 
classical definition of estimable functions is limited to only linear combina- 
tions of the model parameters that are independent of parameter constraint 
(Searle 1971: 180). 

For context, consider first the analysis of an APC dataset for a finite num- 
ber of time periods p. That is, suppose that an APC analysis is to be con- 
ducted for a fixed matrix of observed rates or event counts. This implies that 
the corresponding design matrix X is fixed Oe, X has a fixed number of age 
groups and time periods). The randomness in the error term g of model (4.5) 
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of Chapter 4 then corresponds to measurement errors in the rates or in the 
event counts or to intrinsic randomness in the rates or counts. In this context 
of an age-by-time period table of population rates with a fixed number of 
time period p of data, Yang, Fu, and Land (2004: 101) showed that the IE satis- 
fies a condition for estimability of linear functions of the parameter vector b 
that was established by Kupper et al. (1985: Appendix B) and recently fur- 
ther elaborated by Fu (2008). Estimable functions' are invariant with respect to 
whatever solution [see Equation (4.7) in Chapter 4] is obtained to the normal 
equations; estimable functions are desirable as statistical estimators because 
they are linear functions of the unidentified parameter vector that can be esti- 
mated without bias, that is, they have unbiased estimators.‘ Specifically, the 
condition for estimability of a constraint on the parameter vector of the APC 
accounting/multiple classification model that was established algebraically 
by Kupper et al. (1985) is, in the notation defined previously, that If B, = 0, 
where l! is a constraint vector (of appropriate dimension) that defines a linear 
function lb of b. In words, the Kupper et al. (1985) condition for estimabil- 
ity of a constraint on the parameter vector of the APC accounting model is 
that the constraint must be orthogonal or perpendicular to the null vector of 
the model. 

Applying this condition to the IE, note first that, since the IE imposes the 
constraint that s = 0, that is, that the arbitrary vector B, has zero influence, 
the constraint vector is IT = (I — B,B,). Since BIB, = 1, it follows that ITB, = 
(I - B,B5)Bo = By - BjB;B, = B, - B, = 0; that is, the Kupper et al. condition 
holds for the IE. Note also that the Kupper et al. condition implies that any 
constrained estimator that is obtained by imposing an equality constraint 
on the parameter vector b and that contains any nonzero component due 
to the vector B, defined by the design matrix is not estimable; that is, such 
a constrained estimator produces biased estimates of the A, P, and C effect 
coefficients in the projection of the unconstrained parameter vector b to ©, 
the nonnull space of X, that is, b, = Pb, the coefficient vector estimated 
by the IE. 

Because the IE B satisfies the Kupper et al. estimability condition for APC 
models, it follows from properties of estimable functions (Searle 1971: 181) 
that, for a fixed number of time periods of data, the IE B is an unbiased esti- 
mator of the special parameterization (or linear function) b, of b defined in 


` See Searle (1971: 180-188) or McCulloch and Searle (2001: 120-121) for expositions of this 
concept. 

* In the history of methodological discussions of the APC accounting model in sociology, 
Rodgers (1982a) was early to argue that analysts should seek estimable functions of the 
unidentified parameter vector; see also the comment by Smith, Mason, and Fienberg (1982) 
and the response by Rodgers (1982b). In some respects, the IE can be regarded as providing 
a practical, easily applicable method to produce estimates of estimable functions from data 
in the form of age-by-time period tables of rates, as called for by Rodgers over two decades 
ago. The estimability referred by Rodgers, however, essentially means identifiability that can 
be achieved by any linear constraints, which differs from statistical estimability defined for 
APC models by Kupper et al. (1985). 
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Equation (5.6): Thus, a first statistical property of the IE in the context of a linear 
model for APC analysis with a fixed number of time periods of data is that it produces 
unbiased estimates of the regression coefficients of the projected coefficient vector by. 

In addition, Yang, Fu, and Land (2004: 108) also showed that, for a fixed 
number of time periods of data, the IE is more statistically efficient (has a smaller 
variance) as an estimator of by than any CGLIM estimator that is obtained from a 
nontrivial equality constraint on the unconstrained regression coefficient estimator 
b, that is, any equality constraint that does not produce a projection of b onto by. The 
proof of this property is given in Appendix 5.2. In sum, the IE then has desir- 
able finite-time period properties. 

In brief, given an age-by-time period table of population rates or propor- 
tion for a finite number of time periods p of data—the classical context of 
the APC accounting/multiple classification model of demography, epide- 
miology, and the social sciences—the IE possesses the desirable statistical 
properties of unbiasedness and relative efficiency as an estimator of the b, = 

Ph constrained APC coefficient vector. When these statistical properties 
are taken together, and noting that the choice of the variance of estimators 
as a criterion for assessing their relative desirability is equivalent to choos- 
ing a quadratic norm in the underlying Euclidean vector space, the IE can be 
characterized as a minimum norm quadratic unbiased estimator (MINQUE)* This 
property is explained in Appendix 5.3. 


5.3.2 Asymptotic Properties 


The asymptotic properties of APC estimators, including the CGLIM and 
the IE, as the number of time periods p of data increase have been studied 
by Fu, Hall, and Rohan (2004), Fu and Hall (2006), and Yang, Fu, and Land 
(2004). These properties derive largely from the fact that the eigenvector 
B, converges elementwise to zero with increasing numbers of time peri- 
ods of data. The vector can converge elementwise to zero even though its 
length is fixed at one because the number of elements of the vector grows 
as we add time periods. Therefore, for any two estimators b, = B + s;By and 
b, = B + s,By, where s, and s; are nonzero and correspond to different iden- 
tifying constraints placed on model (4.5), as the number of time periods in 
an APC analysis increases, the difference between these two estimators b, 
and b, decreases toward zero, and in fact, the estimators converge toward 
the IE B. Suffice it to say that the proof proceeds by demonstrating that 
the coordinates of B, are bounded by a quantity that is a function of the 


` A direct proof of the unbiased property of the IE also is given in Appendix 5.1. 

* No other estimator of the coefficient vector of the APC accounting model can be a minimum 
quadratic norm estimator unless it is identical to that of the IE. This is due to the fact that the 
IE is estimated by the Moore-Penrose generalized inverse matrix, whose projection on 
the null space of the APC model is zero. This projection defines the minimum of the qua- 
dratic norm; see Appendix 5.3. 
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number of age groups and periods, and this function converges to 0 as 
p > œ (Yang, Fu, and Land 2004: Lemma 1). 

In addition, for the conventional linear regression specification of the APC 
accounting model [Equation (4.5) of Chapter 4] with a zero expected value 
and finite variance specification on the error term, Fu and Hall (2006) proved 
that (1) as the number of time periods of data increases, there are definite 
bounds on the absolute values of the differences between the effect coef- 
ficients estimated by the IE and the age effect coefficients of the projected 
coefficient vector by = P,,,,;b and on the expected values of the maximum dif- 
ferences between the estimated values of the period and cohort coefficients 
and their b, counterparts; (2) these bounds decrease with an increasing num- 
ber of time periods of data, and the convergence/rate of decrease is most 
rapid for the age effect coefficients. In other words, in many circumstances 
likely to apply to empirical data, the IE gives consistent estimators of the 
components of bo. 


5.3.3 Implications 


These statistical properties are not trivial and merit comment. Both the IE 
B and any other estimator b = B + sB, with s z 0 obtained from an equality 
constraint are asymptotically consistent as the number of time periods of 
data increases without bound. Therefore, with a large number (e.g., 30 or 40) 
of periods of data, differences among estimators decline, and it makes little 
difference which identifying constraint is employed. In most empirical APC 
analyses, however, there usually are a small number (e.g., 4 or 5) of time 
periods of observations available for analysis. In these cases, the differences 
can be substantial. 

As just noted, the IE, by its very definition and construction, satisfies the 
estimability condition of Kupper et al. (1985). Other estimators using equal- 
ity or other linear constraints on the parameter vector b derived from theory 
Or prior research or information on side conditions on a specific process 
being studied may satisfy this condition either exactly or statistically (in a 
sense defined further in the chapter). If other estimators do indeed satisfy 
the estimability condition, then they also produce unbiased estimates of the 
A, P, and C effect coefficients in the b, vector. If not, then the estimates they 
produce are biased. 

These properties provide a means for differentiating among estimators. 
That is, for tables of rates with a finite number of time periods of data, espe- 
cially a small number (e.g, 4 or 5), an unbiased estimator of b, should be 
preferred to a biased estimator as the latter can be misleading with respect 
to the estimated trends across the age, time period, and cohort categories. 

In contrast, as has been noted many times over the years in discussions of 
the APC accounting model (see, e.g., Pullum 1978, 1980; Rogers 1982; Smith, 
Mason, and Fienberg 1982), different just-identified models will generate 
the same data and yield exactly the same model fit. In particular, linear 
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transformations of the estimated A, P, and C coefficients obtained by the IE 
(i.e., linear transformations of the elements of the B vector) will fit the observed 
data just as well as the IE." If, however, such a linear transformation of the A, 
P, and C coefficients (or any subset thereof) results in coefficients that depart 
sufficiently far from the coefficients in B that the transformed coefficients 
contain a significant component of the B, vector De, contain a significantly 
nonzero s coefficient), then the resulting transformed vector may not be esti- 
mable in the sense of satisfying the Kupper et al. (1985) condition for estima- 
bility; that is, it may not be unbiased.* Thus, even though the transformed 
coefficients will reproduce the data just as well as those obtained by the IE, 
they will be biased and will give poor indications of the patterns of change 
across the A, P, and C categories used in the analysis. Therefore, goodness 
of fit to the data (as measured, e.g., by log-likelihood functions or deviance 
statistics) cannot be used as a criterion for selecting among estimators. But, 
estimability can be so used. 

Because of its estimability and unbiasedness properties, the IE may pro- 
vide a means of accumulating reliable estimates of the trends of coefficients 
across the A, P, and C categories of the APC accounting model. To provide 
intuition for this statement, recall the distinction between the steady-state 
and general solutions to the ordinary differential equation for, say, Hooke's 
law for the motion of a displaced spring-mass system subject to an additional 
forcing motion in classical mechanics. This law has the algebraic form F = 
—kx + acos(wt), where F denotes acceleration (second derivative of the motion 
with respect to time) of the mass, x denotes the distance of displacement, k is 
a constant unique to the particular spring under study, and acos(œt) is the 
forcing term (see, e.g., Marion and Thornton 1995: 125). If the mass is dis- 
placed by, say, a distance of 2 feet, it will oscillate back and forth with some 
influence of the length of the initial displacement, but it eventually will set- 
tle down to a characteristic pattern of oscillations that depends only on the 
driving force. By comparison, if the initial displacement of the mass is a dis- 
tance of 4 feet, then the mass will display an initial set of larger oscillations 
back and forth that are different from the pattern observed for the 2-foot 
displacement. But, after an initial series of oscillations, the driving force will 
cause the mass to settle into the same set of oscillations back and forth as 
those found after an initial 2-foot displacement. Mathematically, the pattern 
of oscillations observed after the impacts of the initial lengths of displace- 
ments have worn off are termed the steady-state solution of the Hooke's law 
differential equation, whereas the general solution of the equation consists of 
the steady-state solution plus a factor that takes into account the initial condi- 
tions or displacement of the spring. Because initial conditions can vary from 


` Geometrically, a linear transformation of the coefficient vector corresponds to a rotation of 
the vector in parameter space. Such a rotation will produce distorted and misleading indica- 
tions of patterns of change across the age, period, and cohort categories used in the analysis. 

* This point is illustrated in our empirical analyses and is the basis of the statistical test for 
estimability derived in Section 5.6. 
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application to application, the general solution of the differential equation 
can be quite unique to the application. On the other hand, the steady-state 
part of the solution is invariant and generalizable to the motion of the system 
regardless of the initial conditions. 

Analogously, the IE is essentially a steady-state solution to the APC 
accounting model estimation problem that factors out the initial conditions 
of the dimension of the matrix of observed data, namely, the number of age 
and time period categories that define the design matrix. Because the IE does 
not allow these "initial conditions" to influence the estimates it produces of 
the A, P, and C effect coefficients, they will be more invariant to changes in 
the design matrix, such as an additional time period of data, than estimates 
produced by estimators that incorporate such influences. In this sense, the IE 
removes the part of the subjectivity in the estimator that is due to the shape 
of the data. We illustrate this feature of the IE in our model validation analy- 
ses that follow. 


5.4 Model Validation: Empirical Example 


In brief, the IE possesses some valuable properties as a statistical estimator. 
But, given the long history of problems and pitfalls in proposed methods 
of APC analysis, it is reasonable to question whether this estimator gives 
numerical estimates of A, D, and C effect coefficients that are valid. This isa 
question of model validation—that is, does the identifying constraint imposed 
by the IE produce estimated coefficients that meaningfully capture the 
underlying age, time period, and cohort trends. 

One approach to the question of validity is to compare results from an APC 
analysis of empirical data by application of the IE with results from an analy- 
sis of the same empirical data by application of a different family of models 
that do not use the same or any identifying constraint. As an instance of 
such an empirical comparison, we next analyze the General Social Survey 
(GSS) verbal ability data described in Chapter 3. We apply the IE to data 
on verbal test scores grouped into five-year age groups and time periods 
shown in Table 5.1. In this age-by-period array, there are 12 five-year age 
categories from ages 20 to 75+, 6 five-year period groups from 1976 to 2006, 
and 12 + 6 — 1 = 17 ten-year birth cohorts born in 1901 to 1981. This yields 
72 degrees of freedom. The event/exposure rates (sample mean proportions) 
of correct answers to GSS vocabulary questions can be transformed by a log 


* We excluded ages 18-19 to obtain age groups of equal interval length so that the diagonal ele- 
ments of the age-by-period matrix refers to cohort members. The results are not influenced 
by omitting this cell of small sample size. Ages 75-89 are grouped into a last 75+ category to 
combine small population exposures for more stable estimates. 
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TABLE 5.1 
Verbal Test Correct Rates (Exposure): GSS 1976-2006 


Period 


Age 1976-1980 1981-1985 1986-1990 1991-1995 1996-2000 2001-2006 


20-24 0.55 0.53 0.53 0.55 0.56 0.57 
(8203) (8489) (5553) (4950) (5045) (1930) 
25-29 0.60 0.55 0.58 0.59 0.58 0.57 
(9671) (10653) (6946) (6480) (6780) (2730) 
30-34 0.63 0.63 0.59 0.61 0.60 0.60 
(8643) (8718) (7096) (7745) (7740) (2660) 
35-39 0.63 0.64 0.63 0.63 0.62 0.61 
(6776) (7946) (6799) (7960) (8400) (2780) 
40-44 0.62 0.63 0.64 0.65 0.63 0.62 
(5619) (5996) (5548) (6940) (8355) (2870) 
45-49 0.61 0.60 0.62 0.68 0.65 0.61 
(4595) (5112) (4734) (5915) (6775) (3150) 
50-54 0.62 0.60 0.59 0.62 0.66 0.65 
(5454) (4560) (3314) (4770) (5885) (2570) 
55-59 0.63 0.59 0.59 0.62 0.68 0.65 
(5439) (5335) (3207) (3590) (4225) (2400) 
60-64 0.63 0.60 0.56 0.61 0.60 0.66 
(4915) (4830) (3319) (3430) (3700) (1950) 
65-69 0.61 0.58 0.58 0.65 0.60 0.65 
(4010) (498) (3633) (3000) (2985) (1520) 
70-74 0.52 0.57 0.61 0.59 0.61 0.63 
(3420) (3333) (2789) (3305) (3540) (1290) 
75+ 0.55 0.53 0.54 0.57 0.56 0.63 
(4394) (4264) (3584) (4890) (4970) (1890) 


link and modeled by a log-linear regression for which the IE can be obtained. 
The events are the total number of correct answers for every age and period 
group and are calculated by multiplying the mean verbal scores by the num- 
ber of individuals. These are nonnegative counts and can be considered to 
be distributed as Poisson variates. The population exposure is calculated as 
the product of number of people in each cell and the total number of pos- 
sible correct answers (10). Vocabulary knowledge test scores are not available 
every year from 1974 to 2006. Therefore, for those missing years before 2000, 
we interpolated the mean verbal scores and numbers of individuals at risk 
based on the data of neighboring years. There are only two waves of data 
after 2000, so we just combined them in the calculation. 

Figure 5.2 shows the results from application of the IE to these data 
(accompanying codes available online). Estimated coefficients and their 
95% confidence intervals are plotted for successive categories within the 
age, period, and cohort classifications. Since they indicate changes in correct 
answer rates from one age group to the next, from one time period to the next, 
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Intrinsic Estimator coefficient estimates of the age, period, and cohort effects on vocabulary 
scores. 


and from one cohort to the next, the estimated coefficients represent the tem- 
poral trends of vocabulary knowledge along each of these three dimensions, 
net of the effects of the other two. The same data were analyzed by a series 
of studies by Yang and colleagues (Yang and Land 2006; Yang 2006; Frenk, 
Yang, and Land 2012) using a completely different approach to APC analy- 
sis—namely, a HAPC analysis in the form of a cross-classified random effects 
regression model. We more fully describe the HAPC model in Chapter 7. 
Briefly, this approach proceeds by building a level 1 fixed effects regression 
model at the individual level of analysis and then a random effects model for 
cohort and time period effects at level 2. The mixed effects model does not 
rest on the assumption of additive and fixed A, D, and C effects used by the 
conventional linear regression models that cause the identification problem. 
The resulting estimates of cohort and period effect coefficients are average 
residual effects of the cohorts and period across all time periods and cohorts, 
respectively, and are not constrained in any way to conform to the IE or any 
other identifying constraint required by the accounting model. The results of 
the HAPC analysis are shown in Figure 5.3, which is reproduced from Frenk, 
Yang, and Land (2012) and Table 72 in Chapter 7. 

While the coefficient metrics and timescales for the age and period effects 
in Figure 5.3 are in single years rather than the 5-year groups of those from 
the IE analysis in Figure 5.2, the trends of estimated effect coefficients are 
quite similar across the graphs. That is, both exhibit age effect curves that 
are quadratic and concave, corroborating those found by Wilson and Gove 
(1999): low at youth, rising to a peak in the 40s, staying largely flat until the 
mid-50s, and declining gradually into late life. Both produced period effect 
curves that showed slight declines from the 1970s into the 1980s followed by 


92 Age-Period-Cohort Analysis 


Age Effect Period Effect 


9v 
w 
1 


Verbal Test Score 
m o 
o = 
) 
Verbal Test Score 


L l | LL LL p L 1 1 1 1 1 sik = == P= [pep 
18 23 28 33 38 43 48 53 58 63 68 73 78 83 88 1974 1978 1982 1986 1990 1994 1998 2002 2006 


Age Year 


Cohort Effect 


Verbal Test Score 
E 
a Uu 
| 


5 l l l l l l l l l l l l l l l l l l J 
—1894 1895 1900 1905 1910 1915 1920 1925 1930 1935 1940 1945 1950 1955 1960 1965 1970 1975 1980 1985 


Cohort 


FIGURE 5.3 
HAPC model estimates of age, period, and cohort effects on verbal scores. 


slight rises into the 1990s that flattened out until 2006. Variation in vocabu- 
lary knowledge over time for the past 30 years, therefore, is quite revealing 
given the absence of direct estimates of period effects in the previous studies. 
And, the cohort effect curves are also quite similar—showing two peaks for 
cohorts born in the early and middle twentieth century followed by declines 
in recent cohorts. These results give mixed support for the hypothesis of 
intercohort declines in previous studies (Alwin and McCammon 1999; Glenn 
1999) as there are also increases in vocabulary knowledge for cohorts born 
before World War I and between 1930 and 1950. 

In sum, this comparison shows that the independent estimates from a 
mixed effects models analysis corroborate the estimated patterns of change 
across age, period, and cohort categories that are obtained by the IE analysis. 
Of course, this is only one example of comparative analysis, and additional 
empirical studies are needed before it can be concluded that the IE pro- 
duces substantively meaningful and empirically valid results under vari- 
ous circumstances. Additional model validation assessments of the intrinsic 
estimation through empirical analyses of various datasets are reported in 
Section 5.6 and in Chapter 8, Section 8.3. 


5.5 Model Validation: Monte Carlo Simulation Analyses 


Given the long history of debates over the existence of any solution to the 
APC model identification problem (Glenn 2005), questions can be raised 
regarding whether the IE method is based on a constraint of purely algebraic 
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convenience and just as arbitrary as previous methods of constraints. Is the 
IE just another way to go wrong? The conceptual foundations of the IE have 
been established statistically using mathematical proofs and thus remain 
abstract and potentially difficult to understand. Further exposition and anal- 
ysis, especially using straightforward and replicable numerical illustrations, 
is needed to directly address this question. 

This section reports the numerical results from Monte Carlo (MC) simula- 
tions that are conducted systematically for the IE and CGLIM estimators of 
APC multiple classification models based on the work of Yang, Schuholfer- 
Wohl, and Land (2007). We compare results from application of an estimator 
such as the IE or CGLIM to artificial data wherein we know the form of the 
underlying model that generated the data.’ The simulation analyses can help 
to determine conditions under which the IE indeed recovers the parameters 
of the underlying model while CGLIM estimators do not. Then, we revisit 
a recent critique of the utility of APC models in social research through 
numerical examples (Glenn 2005), show the consequence of misuse and mis- 
interpretation of such models, and make suggestions to future research in 
light of findings from the simulation analysis. 

We first investigate whether the IE is unbiased and is relatively efficient in 
samples with a fixed number of age groups and time periods. The asymp- 
totic results for the IE summarized in Section 5.3.2 apply as the number of 
periods in the dataset goes to infinity. But, any dataset used in practice has 
only a finite number of periods. Thus, we explore whether the asymptotic 
results give good approximations to the behavior of the IE in finite samples 
by simulating datasets with 5, 10, and 50 periods. Based on the mathemati- 
cal analyses of Fu and Hall (2006) summarized in Section 5.32, the basic 
asymptotic result we investigate in these numerical simulations is that, as 
the number of periods increases, estimated age effects should converge 
to the underlying age effects when using the IE but not necessarily when 
using other estimators. If the underlying processes generating the period- 
to-period changes in the observed outcomes are constant throughout the 
periods of the simulation, then we also expect the estimated period and 
cohort effects converge to the underlying period and cohort effects when 
using the IE* We conduct the simulation analyses systematically to examine 
the performance of the IE and alternative methods, CGLIM in particular, 


` [n the methodological literature on estimation of the APC accounting model, a distinction 
often is made between the "true (but unknown) population parameter values" (Kupper et al. 
1985: 822) or "the population parameters that generated the data ... the 'true' solution (the one 
representing the process that generated the outcomes)" (O'Brien 2011a: 436). In the exposition 
that follows, we use the term underlying model for this referent. 

If such processes are not constant and change from period to period, regardless of the estima- 
tor used, estimated period and cohort effects cannot be expected to converge to their under- 
lying values as the number of periods increases because adding a period to the dataset does 
not add information about the previous periods or about cohorts not present in the period 
just added. 


T 
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in reproducing the underlying models when the underlying model is a full 
APC model in which all three of the A, P, and C effects are present. 


5.5.1 Results for APC Models: True Effects of A, P, and C All Present 


We fix the number of age categories in all simulations given the fact that 
humans have a relatively fixed life span. We let the number of age categories 
be 10 without loss of generality. For a given number of periods P, we generate 
1,000 datasets by MC simulation in which the entries in the 10 x P outcome 
matrix are distributed according to 


Vj ~ N(u,o°) 
where’ 
u =0.3+0.1(age;; — 5.5)2 +0.1 sin(period;;) + 0.1 cos(cohort;;) + 0.1sin(10 - cohort;;) 
0: 295 


This equation for the data-generating process tells us what the underlying 
age, period, and cohort effects are: 


Age effect at age a 0.1(a — 5.5)? 
Period effect in period p — 0.1 sin(p) 
Cohort effect in cohorte 0.1 cos(c) + 0.1 sin(10c) 


So that the underlying effects have mean zero in each category in accord 
with the constraints on the effect coefficient specified previously, we sub- 
tract constants from the effects listed, where the constants are calculated 
as the mean effects for each temporal category. To explore the finite time 
period properties of various estimators, we then estimate A, P, and C effects 
in each simulated dataset for a given P using the IE and using three dif- 
ferent CGLIM estimators: one with the first two age effects constrained to 
be equal (CGLIM al, one with the first two period effects constrained to be 
equal (CGLIM. p), and one with the first two cohort effects constrained to 
be equal (CGLIM. o To explore the large-sample properties of these estima- 
tors, we let P increase from 5 to 10 and to 50 and repeated the simulations for 
each number of P 

Table 5.2 reports the results for A, P, and C effects estimated from data sim- 
ulated with five time periods (P = 5). For each effect in the model, we show 
the underlying value and, for each estimator, the mean, standard deviation, 


* We chose the variance of 25 so that the sampling variability of the estimator would be visible 
in our graphs. Experiments with smaller and larger variances produced qualitatively simi- 
lar results. 
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FIGURE 5.4 
Means of estimates from 1,000 simulations of APC models with P = 5: IE versus CGLIM. 


and MSE of the estimated effect across 1,000 simulations. By comparing the 
mean of the simulated estimates to the underlying values, we can assess 
the degree of unbiasedness for each estimator. The standard deviation of the 
simulated estimates shows how much the estimated parameters vary from 
sample to sample. Smaller variance relates to relative efficiency. MSE is the 
average squared difference between the estimated parameter and the truth; 
this measure of accuracy takes into account both bias and variance. 

Figure 54 compares the means of IE and CGLIM estimates shown in 
Table 5.2. Two of the four estimators recover the profile of the A, D, and C 
effects qualitatively: the IE and the CGLIM. p. The other two sets of CGLIM 
estimates that constrain the first two age and first two cohort effects to be 
equal clearly fail to recover underlying forms of these effects because the 
constraints are incorrect: The first two underlying age effects and the first 
two underlying cohort effects are not equal, and the differences between the 
two underlying effects are large. The CGLIM, p estimator recovers the qual- 
itative shapes of underlying effects more closely because constraining the 
coefficients of the first two period effects to be equal more closely approxi- 
mates the fact that the difference between the two underlying effects is much 
smaller. Scrutiny of the numerical results in Table 5.2, however, suggests that 
CGLIM. p estimates are far off the mark in quantitative terms; only for the 
IE is the mean of each estimated effect close to the underlying value and 
hence unbiased. This is a direct result of the nonestimability of the CGLIM 
estimator. That is, any substantial departure from the IE constraint, which 
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incorporates a large nonzero s, will not yield an estimable function and thus 
will induce bias in the estimates. 


5.5.1.1 Property of Estimable Constraints 


This constrained model illustrates numerically a property established algebra- 
ically by Kupper etal. (1985: 830), namely, that, if the constraint used to just iden- 
tify the APC accounting model is, indeed, satisfied by the underlying model, 
then the Kupper et al. orthogonality condition stated previously (/7By = 0) will 
hold, and the corresponding constrained coefficient vector is estimable. 

With respect to the specification of the underlying effect coefficients in the 
present simulation, for example, it can be noted that, due to the periodicity of 
the age coefficients, several are equal. Thus, for example, if the analyst were 
to impose the identifying constraint al = a10, then the corresponding con- 
strained coefficient vector will be estimable, and in fact, the resulting CGLIM 
estimated coefficients will be within sampling and rounding error of those 
estimated by the IE. We make this point more precise through a statistical 
test described in Section 5.6. 

Table 5.2 also shows that the IE exhibits substantially less sampling varia- 
tion than the CGLIM estimators. The IE estimates of A, P, and C effects have 
standard deviations that range between 0 and 1. CGLIM, p estimates have the 
smallest standard deviations among all CGLIM estimates, but their stan- 
dard deviations are still at least 10 times larger than those of the IE. The IE 
also has much smaller MSEs. The MSEs of the IE estimates are close to 0, 
whereas those of the CGLIM, p estimators can be as large as 6. All estimators 
have larger MSEs for the youngest and oldest cohorts because these cohorts 
are located at the upper and lower corners of the age-by-period table and 
have the smallest sample sizes. The CGLIM_a and CGLIM c estimates have 
MSEs too large to provide reliable findings. It is noteworthy that the cohort 
effects are particularly poorly estimated by the CGLIM models. 

To see what happens to these estimators in cases when analysts have access 
to more data, we next increased the number of time periods to 10 and 50. 
Because the CGLIM p continued to yield the best estimates among all CGLIM 
estimators, the following analysis focuses on the comparison of the CGLIM, p 
and the IE and is summarized in Table 5.3. 

The means and standard errors of age effects estimates are shown in 
Figure 5.5 for the IE and the CGLIM_p by number of periods P. The means of 
the IE are extremely close to the underlying age effects for all P and rapidly 
approach the underlying age effects as P increases from 5 to 50. The means 
of the CGLIM p also recover the underlying age effects well and do better 
with increasing P, but to a less extent than the IE. Although the means of the 
IE and CGLIM p are close, the CGLIM_p shows much larger standard errors 
and thus a statistically significant difference between the mean and the truth. 
Comparison of the standard errors across P (not shown) suggests decreasing 
sampling variations for both estimators with increasing P but much smaller 
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Mean squared errors of the age effects estimates from 1,000 simulations of APC models by P: 
IE versus CGLIM. 


variability for the IE for all P. Figure 5.6 further shows the advantage of the 
IE in terms of MSE. The IE has MSE much closer to 0 than the CGLIM_p 
for all P. Whereas the MSE of the IE approaches 0 as P increases, that of the 
CGLIM. p, although decreasing, is far above 0. 

Figures 5.7 and 5.8 present the mean and MSE of the IE and CGLIM. p esti- 
mates of period effects and cohort effects, respectively. Similar to the results 
shown previously, the IE recovers the underlying effects much better for all 
P, increases in precision, and decreases in MSE with increasing P. In contrast, 
the CGLIM. p shows much larger departures from the underlying effects that 
do not decrease with increasing P. The first two time period coefficients were 
constrained to be equal by the CGLIM. p, whereas in fact they increase slightly 
from time 1 to time 2. Asa result, the period effects estimated by the CGLIM p 
rotate the underlying period effects (horizontal oscillations) upward. And, the 
cohort effects estimates are rotated downward. While the IE has a MSE close 
to 0, the CGLIM p also produces much larger and in some cases increasing 
MSE with increasing P. This illustrates that linear constraints with even small 
deviations from the truth can result in coefficient estimates with large bias in 
unknown directions that will not lessen with more periods of data. 

Several insights follow from this analysis. First, the IE produces estimates 
of the A, P, and C effects that are more invariant to changes in the design 
matrix, such as additional time periods of data, than estimates produced 
by estimators that incorporate functions of the design matrix (such as the 
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Period effects estimates from 1,000 simulations of APC models by P: IE and CGLIM. 
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CGLIM). This precisely is because of its estimability and unbiasedness prop- 
erty. In this sense, the IE reduces the part of the subjectivity in the estimator 
that is due to the influence of fixed component determined by the shape of 
the data by removing it. i 

Second, both the IE B and any other estimator b = B + sB, with s z 0 obtained 
from an equality constraint produce asymptotically consistent age effects 
as the number of time periods of data increase without bound. Therefore, 
with a large number (e.g., 50) of periods of data, differences among estima- 
tors decline, and it makes little difference which identifying constraint is 
employed. In most empirical APC analyses, however, there usually are a 
small number (e.g., 5) time periods of observations available for analysis. In 
these cases, the differences can be substantial, and an unbiased estimator 
should be preferred to a biased estimator as the latter can be misleading with 
respect to the estimated trends. 

Third, as a result of the properties mentioned, the IE may provide a means 
for the accumulation of reliable estimates of the A, P, and C trends when 
more data become available over time, whereas the other estimators may not. 


5.5.2 Misuse of APC Models: Revisiting a Numerical Example 


The previous exposition and numerical simulation analyses suggested that 
(1) the IE indeed yields unbiased and relatively efficient estimates of A, P, and 
C effects in the IE constrained coefficient vector b, = P;,;b; (2) the estimates of 
the elements of b, will be relatively close to the estimate of any equality con- 
strained coefficient vector that satisfies the estimability condition of Kupper 
et al. (1985); and (3) the elements of b, converge to the underlying effects with 
an increasing number of time periods. This seems to be in conflict with the 
age-old notion that there is no solution to the APC accounting model identifi- 
cation problem because there can be any number of estimates that fit the data 
equally well. This notion is best represented in a recent critique of the utility 
of APC models in social research raised by Norval Glenn (2005). We next 
revisit the numerical example given by Glenn to evaluate this critique. 
Glenn based his analysis on some hypothetical datasets that are cited here 
as Table 5.4 for purpose of illustration. These data potentially show very dif- 
ferent A, P, and C effects. The values of the dependent variable in Table 5.4a 
show obvious age variations with an increment of 5 for each successive age, 
but seemingly no period or cohort effects. Glenn correctly pointed out that 
there could be some combination of A and offsetting P and C effects, and an 
infinite number of combinations of such effects can produce the pattern of 
variation. Data in Tables 5.4b and 5.4c show stronger period and cohort vari- 
ations, respectively, and similarly can arise from many different combina- 
tions of underlying effects. In subsequent analyses, Glenn used the CGLIM 
approach to estimate APC models of these data. We present these results of 
the CGLIM analyses for data of Table 5.4a in Table 5.5. Corresponding analy- 
ses were conducted for the other two datasets but are not reported because 
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TABLE 5.4 


Patterns of Data Showing Age, Period, and Cohort 
Effects (Glenn 2005) 


Table 5.4a “Pure Age Effects” (Glenn 2005: Table 1.2) 


Age 


Year 


1950 1960 1970 1980 1990 2000 


20-29 
30-39 
40-49 
50-59 
60-69 
70-79 


50 50 50 50 50 50 
55 55 55 55 55 55 
60 60 60 60 60 60 
65 65 65 65 65 65 
70 70 70 70 70 70 
75 75 75 75 75 75 


Table 5.4b "Pure Period Effects" (Glenn 2005: Table 1.3) 


Age 


Year 


1950 1960 1970 1980 1990 2000 


20-29 
30-39 
40-49 
50-59 
60-69 
70-79 


30 35 40 45 50 55 
30 35 40 45 50 55 
30 35 40 45 50 55 
30 35 40 45 50 55 
30 35 40 45 50 55 
30 35 40 45 50 55 


Table 5.4c "Pure Cohort Effects" (Glenn 2005: Table 1.4) 


Age 


Year 


1950 1960 1970 1980 1990 2000 


20-29 
30-39 
40-49 
50-59 
60-69 
70-79 


50 55 60 65 70 75 
45 50 55 60 65 70 
40 45 50 55 60 65 
35 40 45 50 55 60 
30 35 40 45 50 55 
25 30 35 40 45 50 


Source: Norval Glenn (2005). Cohort Analysis, 2nd ed. 


Note: 


Thousand Oaks, CA: Sage Publications. 
As noted by Glenn, data in Table 5.4a could show 
alternative true effects such as pure A effects, offset- 
ting P and C effects, or a combination of A effects 
and offsetting P and C effects. The same applies to 
data in Tables 5.4b and 5.4c. 
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TABLE 5.5 


Regression Coefficients of Linear APC Models of 
Data in Table 5.4 Estimated by the CGLIM and IE 


CGLIM IE 


Variable 1 2 3 4 5 


Intercept 62.5 62.5 62.5 62.5 62.5 


Age 

20-29 -12.5 -12.5 0 -5 -112 
30-39 -7.5 -7.5 0 -5 —6.7 
40-49 -2.5 -2.5 0 -1.4 -2.2 
50-59 2.5 2.5 0 1.1 22 
60-69 7.5 7.5 0 3.6 6.7 
70-79 12.5 12.5 0 6.4 11.2 
Period 

1950 0 0 -12.5 -5 -1.3 
1960 0 0 -7.5 -5 -0.8 
1970 0 0 -2.5 -1.4 -0.3 
1980 0 0 2.5 1.1 0.3 
1990 0 0 7.5 3.6 0.8 
2000 0 0 12.5 6.4 1.3 
Birth Cohort 

1880 0 0 25 11.2 2.6 
1890 0 0 20 10.1 2.1. 
1900 0 0 15 74 1.6 
1910 0 0 10 53 1.1 
1920 0 0 5 3 0.5 
1930 0 0 0 0.2 [U 
1940 0 0 -5 -2 -0.5 
1950 0 0 -10 —.7 -1 
1960 0 0 -15 -7.3 -1.6 
1970 0 0 -20 -9.9 -2.1 
1980 0 0 -25 -13.8 -2.6 


Source: Data Modified from Norval Glenn, Cohort 
Analysis, 2nd ed (2005: Table 2.1) Thousand 
Oaks, CA: Sage Publications; the regression 
coefficients are centered to sum to zero within 
age, period, and cohort categories based on the 
usual constraint, Zo, = > B; = £y, = 0; coefficients 
highlighted in bold are constrained to be equal. 
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the results are similar. Models 1 to 4 using Glenn’s results confirm the point 
made previously that different equality constraints result in drastically dif- 
ferent estimates of A, P, and C effects. For purpose of comparison, we add 
model 5 estimated by the IE. There are two fallacies in Glenn's interpretation 
of results of models 1 to 4. 

First, patterns in hypothetical data do not suggest underlying effects gen- 
erating them. Glenn acknowledged that the data in Table 5.4a can be gener- 
ated by any forms of the underlying A, P, and C effects, but then contradicted 
himself in the discussion of the modeling results, claiming that "For this 
simulation experiment, I know what the effects are and can apply the Mason 
et al. method to the data to see how well it performs" (p. 12). Specifically, he 
assigned the underlying effects to be the age effects shown in models 1 and 2 
(although this guess is not wrong, as will be shown in the following by model 
selection analysis). Based on this logically incorrect starting point of what 
the underlying effects are, he went on to conclude that the APC accounting 
model gives “grossly incorrect results" using certain constraints (like those 
in models 3 and 4) and hence it is impossible to estimate APC effects with 
this model in practice when one cannot know what the right constraints are. 
It is clear from our simulation analysis shown previously that one can only 
examine the performance of model estimators by specifying the underlying 
effects that generate the data rather than using certain data to speculate what 
the underlying effects are. 

Second, the assumption that the age, period, and cohort trends in any given 
set of data can be best estimated by full APC models rather than by reduced 
models of one or two of the three effects needs to be tested. If the effects of 
one or two of the three factors are null, the full model will statistically overfit 
the data and produce biased and inaccurate estimates. In addition, the full 
model has the model identification problem. The results shown in models 1 
to 5 reflect precisely this problem. And, different linear constraints used to 
estimate the full model are bound to produce different estimates that are 
inaccurate in different degrees. Unlike in the simulation exercise, analysts 
cannot know which underlying effects are present and which are not, given 
only observed data. 

One way to select among alternative models is to conduct model fit tests 
of whether all three of the A, P, and C effects are present and should be 
simultaneously estimated (see, e.g., Mason and Smith 1985). That is, ana- 
lysts should successively estimate model with the A, P, C, AP, AC, PC, and 
APC sets of effect coefficients and examine the corresponding model fit 
statistics for improvement as additional sets and combinations of coeffi- 
cients are added. This gives a sense of the relative importance of A, D, and 
C effects and the best model that summarizes the trends in the observed 
data. Accordingly, for the data of Table 5.4, we estimated nested models 
and computed model selection statistics—log likelihoods and Bayesian 
information criteria statistics (which are log likelihoods penalized for 
numbers of parameters estimated; see Raftery 1986, 1995)—for the three 
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TABLE 5.6 
Model Fit Statistics for Data in Table 5.4 

For Data in Table 5.4a 
Models Log-Likelihood DF BIC 
A 121.8 6 —107.5 
P —128.3 6 2516.4 
C —115.8 11 1222.5 
AP 123.1 11 —89.6 
AC 132.6 16 —71.7 
PC 132.6 16 —71.7 
APC 122.6 20 —57.3 

For Data in Table 5.4b 
Models Log-Likelihood DF BIC 
A —128.3 6 2517.6 
P 119.9 6 —107.5 
C —115.8 11 1223.5 
AP 121.4 11 —89.6 
AC 125.4 16 —71.7 
PC 129.8 16 —71.7 
APC 131.5 20 -57.3 

For Data in Table 5.4c 
Models Log-Likelihood DF BIC 
A -128.3 6 2518.3 
P -128.3 6 2516.9 
C 126.1 11 —89.6 
AP 125.4 11 —89.6 
AC 133.5 16 —71.7 
PC 129.4 16 —71.7 
APC 124.4 20 —57.3 


Source: Data Modified from Norval Glenn, Cohort 
Analysis, 2nd ed. Thousand Oaks, CA: Sage 
Publications. 

Note: Model fit statistics, BIC (Bayesian Information 
Criterion), are calculated by Stata GLM; the 
smaller the AIC and BIC, the better the model 
fit. The best fitting models for each dataset 
are highlighted in bold. 


sets of data. Applying the usual criterion of selecting models with the 
smallest values of these statistics, the results shown in Table 5.6 suggest 
that the best-fitting models for data sets one, two, and three are A effects 
only, P effects only, and C effects only or AP effects models, respectively. 
Because the full APC models are not the preferred models, the discussion 


APC Accounting/Multiple Classification Model, Part I 109 


of which identifying constraint gives the correct estimates is not pro- 
ductive and can be avoided given the findings from the model selec- 
tion analysis. 

Every statistical model has its limits and will break down under some con- 
ditions. We have found in additional simulation analysis one such condition 
in which the IE produces larger bias in small samples when the underlying 
effects are zero than when they are not zero. We also have shown, by anal- 
yses of Glenn's (2005) numerical example, that researchers should conduct 
careful model selection tests before using full APC models. We reiterate that 
imposition of a full APC model on data when a reduced model fits the data equally 
well or better constitutes a model misspecification and should be avoided. On the 
other hand, when model selection tests indicate that all three of the A, P, and 
C dimensions are operative in producing a given set of data, application of 
the IE may be quite useful in producing meaningful and stable estimates. 
And, MC simulation analysis is one important avenue for model validation. 
In conclusion, the IE performs well as a statistical estimator under most con- 
ditions. The results of the simulation studies can inform empirical studies 
that use the IE for statistical estimation of linear APC models. 


5.6 Interpretation and Use of the Intrinsic Estimator 


Given the desirable properties the IE possesses as a statistical estimator 
and its evident ability to produce valid estimates of A, P, and C effect coef- 
ficients from an underlying generating model, the question becomes one 
of how to interpret and use this estimator. The question of interpretation 
arises because the identifying constraint imposed by the IE on the uniden- 
tified APC accounting model parameter vector b—namely, projection onto 
the nonnull (column) space of the design matrix X—appears to be a con- 
straint of purely algebraic convenience, devoid of substantive meaning. By 
contrast, conventional equality-constrained estimators of APC accounting 
models often are motivated by substantive hypotheses derived from theory 
or prior studies or side information about a process under study that indi- 
cate that certain coefficients are, say, of the same magnitude and hence can 
be constrained to be equal. Thus, the question becomes: How can the IE and 


` An extreme example of patterns of effects of the A, P, and C temporal dimensions is the case 
in which the effect coefficients of two of the dimensions change linearly across their respec- 
tive domains. In this case, one of the linear trends can be written as an exact algebraic linear 
transformation of the other. Accordingly, a reduced model with fewer than three dimensions 
is sufficient to model the data and the estimation of the full model with the IE will result in 
biased and inaccurate estimates of the effect coefficients. APC accounting model applications 
to real empirical data rarely would exhibit such an extreme linear degeneracy, but nonethe- 
less can produce approximations thereto. 
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the Moore-Penrose generalized inverse matrix be used to statistically evalu- 
ate equality-constrained APC accounting models in which the identifying 
equality constraint is based on prior substantive theory or empirical research 
findings? Recall the property of estimable constraints stated in Section 5.5 to 
the effect that if the equality constraints are, in fact, valid, the constraints 
produce an estimable function of the unconstrained coefficient vector b and 
produce CGLIM estimates of APC effect coefficients that will be numerically 
close to those estimated by the IE. This property provides the key to how 
the IE can be used to evaluate substantively motivated equality constraints. 

To focus the discussion, consider the effect coefficient estimates reported 
in Table 5.7 for the cancer mortality data for U.S. females, 1969-2007 (com- 
putational codes available online). The first two columns of the table report 
the coefficient estimates, standard errors (SEs), model deviance, and overdis- 
persion coefficient for the IE applied to these data.’ The next two columns 
report the corresponding estimates for a CGLIM model, which (for reasons 
that will be made clear) we label CGLIM 3, wherein the coefficients for the 
respective first categories of the A, P, and C groups are taken as the refer- 
ence categories and have effects set to zero, and the identifying constraint 
is that the second birth cohort (C2) is constrained to have the same effect 
coefficient (zero) as the first cohort (C1). The bottom rows give the overall 
model fit statistics for these two models. Previously, we noted that all just- 
identified models that incorporate effect coefficients for the full array of A, 
P, and C categories will fit the data equally well. This is evident in the model 
fit statistics here. Thus, to reiterate the point made previously, one cannot 
use fit statistics to discriminate among just-identified models. Rather, some 
other criterion must be employed. The criterion applied here is that the con- 
strained vector must be estimable. 

The numerical estimates in Table 5.7 show that the constraint imposed 
in the CGLIM 3 model (the effect of the 1885 cohort constrained to equal 
that of the 1890 cohort) is statistically valid (i.e., within sampling variability) 
for these data—the estimated effect for the 1885 cohort in the IE column is 
.69, and that for the 1890 cohort is .66 with the SE of the former being .08 
and latter being .04. Thus, the difference of these two effect coefficient esti- 
mates, .04, is well within sampling error and therefore effectively zero. In 
other words, this equality constraint produces an estimable function in a 
statistical sense (to be made precise in the following discussion). The con- 
sequence is that the coefficient vectors for these two models are statistically 
identical up to a centering or normalizing transformation. This equivalence 
is demonstrated numerically in the last column of Table 5.7, which gives the 
numerical values of the corresponding recentered/renormalized CGLIM 3 


* For both the CGLIM and IE models, the Poisson variance function is assumed to have a mul- 
tiplicative overdispersion factor that can be estimated by dividing the deviance by degrees of 
freedom (DF). And, the overdispersion coefficients were estimated using the quasi-likelihood 
method (McCullagh and Nelder 1989). 
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TABLE 5.7 
IE and CGLIM Estimates, U.S. Female Cancer Mortality, 1969-2007 
Centered Centered Centered 
Intrinsic CGLIM 3 (C1 = CGLIM1 CGLIM2 CGLIM3 
Estimator (A1=A2) (P1-P2  (C1-C2) 
Effect S.E. Effect S.E. Effect Effect Effect 
Intercept —5.39 0.01 -7.74 1.23 -5.39 —5.39 —5.39 
Age 
20-24 —2.21 0.04 0.00 0.00 1.05 —2.73 —2.45 
25-29 -1.70 0.03 0.54 0.11 1.05 -2.15 -1.91 
30-34 -1.25 0.02 1.03 0.21 1.00 -1.62 —1.42 
35-39 —0.80 0.02 1.52 0.31 0.95 —1.09 -0.93 
40-44 0.36 0.02 2.00 0.41 0.90 —-0.56 —-0.45 
45-49 0.01 0.01 241 0.51 0.77 —0.11 —0.04 
50-54 0.28 0.01 2.71 0.61 0.53 0.23 0.26 
55-59 0.50 0.01 2.97 0.71 0.25 0.54 0.52 
60-64 0.70 0.01 3.21 0.81 —0.05 0.83 0.76 
65-69 0.87 0.01 3.41 0.91 —0.39 1.07 0.96 
70-74 0.97 0.01 3.55 1.01 —0.79 1.25 1.10 
75-79 1.03 0.01 3.65 1.11 -1.23 1.39 1.20 
80-84 1.03 0.01 3.69 1.21 -1.73 1.47 1.23 
85+ 0.94 0.01 3.63 1.31 -2.32 1.47 1.18 
Period 
1969-74 —0.34 0.02 0.00 0.00 —2.10 —0.06 —0.21 
1975-79 —0.26 0.01 0.04 0.10 —1.51 —0.06 —0.16 
1980-84 —0.15 0.01 0.12 0.20 —0.90 —0.02 —0.09 
1985-89 0.00 0.01 0.22 0.30 —0.25 0.04 0.02 
1990-94 0.07 0.01 0.26 0.40 0.32 0.03 0.05 
1995-99 0.16 0.01 0.31 0.50 0.91 0.04 0.11 
2000-04 0.23 0.01 0.34 0.60 1.48 0.03 0.14 
2005-07 0.28 0.01 0.36 0.70 2.04 0.00 0.15 
Cohort 
1885 0.69 0.08 0.00 0.00 5.71 —0.12 0.32 
1890 0.66 0.04 0.00 0.00 5.17 —0.08 0.32 
1895 0.56 0.03 —0.06 0.13 4.57 —0.09 0.26 
1900 0.47 0.02 —0.11 0.22 3.98 —0.10 0.20 
1905 0.39 0.02 —0.15 0.32 3.40 —0.09 0.17 
1910 0.34 0.02 —0.16 0.42 2.85 —0.06 0.15 
1915 0.29 0.01 —0.18 0.52 2.30 —0.04 0.14 
1920 0.22 0.01 —0.21 0.62 1.73 —0.02 0.11 
1925 0.16 0.01 —0.23 0.72 1.17 0.00 0.09 


continued 
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TABLE 5.7 (continued) 
IE and CGLIM Estimates, U.S. Female Cancer Mortality, 1969-2007 


Centered Centered Centered 


Intrinsic CGLIM 3(C1= CGLIM1 CGLIM2 CGLIM3 
Estimator C2) (A1=A2) (P1=P2) (C1=C2) 
Effect S.E. Effect S.E. Effect Effect Effect 

1930 0.08 0.01 -0.28 0.83 0.58 -0.01 0.04 
1935 0.00 0.01 -0.31 0.93 0.00 0.00 0.00 
1940 -0.08 0.02 -0.35 1.03 -0.58 0.01 —0.04 
1945 -0.16 0.02 -0.40 1.13 -1.17 0.00 —0.09 
1950 -0.27 0.02 —-0.47 1.23 -1.77 —0.02 -0.15 
1955 -0.34 0.02 —0.50 1.33 -2.34 —0.01 —0.19 
1960 —0.40 0.02 -0.52 1.43 -2.90 0.01 -0.21 
1965 -0.45 0.02 -0.54 1.58 -3.46 0.04 -0.22 
1970 -0.48 0.03 -0.53 1.63 -3.99 0.09 -0.21 
1975 -0.52 0.03 -0.54 1.74 -4.58 0.13 -0.22 
1980 —0.60 0.05 —0.58 1.84 -5.11 0.13 -0.26 
1985 -0.58 0.09 -0.52 1.94 —5.60 0.23 -0.21 
t ratio 9.59 -3.26 —0.37 
p-value «0.001 0.001 0.712 
Deviance 1825.3 1825.3 18253 1825.3 1825.3 
DF 72 72 72 72 72 
Overdispersion 254 25.4 25.4 25.4 25.4 


effect coefficients, that is, the CGLIM 3 APC coefficients transformed by sub- 
tracting their respective group means so that the transformed coefficients 
sum to zero.’ The resulting recentered CGLIM 3 and IE effect coefficients 
generally agree. The patterns of the respective groups of A and P effects are 
more similar than those of C effects, which reflects the impact of the cohort 
equality constraint used by CGLIMS3. 


` CGLIM coefficients can be transformed as follows to the normalization that A, P, C effects each 
sum to zero: Under the original CGLIM normalization, let a; be the estimated A effect for age i, 
let p; be the estimated P effect for period j, and let c, be the estimated C effect for cohort k. 
Let d be the estimated intercept. Transforming these estimates to a different normalization 
means that we want to find new coefficients a;, pj, c; and d’ such that (1) the predicted values 
for each data point do not change, and (2) the A, P, and C effects each sum to zero. The solu- 
tion is to subtract the mean of the original A effects from each a; to obtain a new A effect aj 
(this guarantees that the new A effects sum to zero) but then add the mean of the original A 
effects to the intercept, which guarantees that the predicted values do not change. The same 
process can be used to transform the P and C effects. One can compute the corresponding 
standard errors using the variance of the new or transformed design matrix, X**v = LX, where 
L is the normalizing matrix that transforms the X into the form introduced previously. That 
is, the standard errors for X'*v = sqrt(diag(cov(X^**)), where cov(X**v) = Lcoo(X)L! is the new 
variance-covariance matrix. As stated elsewhere in the chapter, such linear transformation 
only makes the coefficients more comparable across different methods of estimation in this 
methodological exposition and does not change the substantive findings of A, P, and C trends. 
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Other possible identifying assumptions do not fare so well. To illustrate 
this, Table 5.7 reports comparable effect coefficient estimates for two alterna- 
tive CGLIM models: CGLIM 1, which identifies the model by constraining 
the effect coefficient for the group aged 25-29 (A2) to be the same as that of 
the group aged 20-24 (A1); and CGLIM 2, which achieves identification by 
constraining the effect coefficient for the 1975-79 time period (P2) to equal 
that for the 1969-74 period (P1). Comparing all three sets of CGLIM coeffi- 
cients, it can be seen that those produced by the CGLIM 1 and CGLIM 2 mod- 
els bracket those produced by the CGLIM 3 model; that is, the two alternative 
CGLIM models yield effect coefficients that diverge substantially from those 
given by the CGLIM 3 model. The divergences are substantial and dramatic. 
The reason for this behavior is that the equality constraints used to produce 
the CGLIM 1 and CGLIM 2 models do not produce statistically estimable 
functions and corresponding coefficient estimates. 

To proceed more systematically, a method is needed for assessing whether 
two estimated coefficient vectors are within sampling error of being equal. 
There are several ways to accomplish this. We have compared centered 
CGLIM effect coefficients element by element with the corresponding esti- 
mated IE effect coefficients to determine whether the former are within, say, 
two standard errors of the latter. Using the estimated standard errors of the IE 
coefficients, it can be seen that most of the centered effect coefficients for the 
CGLIM 3 model are within two standard errors of the IE coefficients. By com- 
parison, this is not the case for the centered effect coefficients of the CGLIM 1 
and CGLIM 2 models. An alternative procedure is to define a test statistic 
based on the entire vector of coefficients. To do so, we state the null hypothesis: 


Ho: E! Bj) = (B+ $By)" By = s= 0 (5.10) 


In words, this null hypothesis is that the expected value of the product of 
the estimated and re-normalized CGLIM vector b and the eigenvector B, that 
is fixed by the design matrix is zero. Due to the orthogonality of the vectors B 
and B,, this, as Equation (5.10) indicates, is equivalent to the hypothesis that 
the expected valued of the scalar s is equal to zero. Using the geometric pro- 
jection illustrated in Figure 5.1, this test is equivalent to testing whether the 
estimated parameter vector under a given set of constraints (b, or b, or Ds) 
lies significantly far away from the estimable function b, so that one can 
infer that its horizontal projection results from a nonzero s. 

To specify a test for this null hypothesis, we build on a well-known asymp- 
totic distribution property of the maximum likelihood estimator (MLE) 
used to estimate b: Under broad regularity conditions, as sample size or the 
number of time periods of data increases, the MLE of b is consistent and 
asymptotically normally distributed (McCulloch and Searle 2001: 306). This 
property facilitates the definition of an asymptotic f test for the null hypoth- 
esis (15) as 
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s—0 S 
See e (5.11) 


where se(s) denotes the estimated (asymptotic) standard error of the scalar s. 
To obtain this test statistic, note that the numerator s can be computed, as 
indicated in Equation (5.10), by calculating the product of the vectors b and 
Bo. Then, the denominator can be computed by transforming the asymptotic 
variance-covariance matrix $- De, the inverse of the Fisher information 
matrix) that is obtained in the process of estimating b by maximum likeli- 
hood to obtain b. Because the vector B, is orthogonal to the design matrix 
[see Equation (5.2)], the variance of the scalar s can be computed by impo- 
sition of the usual quadratic form transformation applied to obtain the 
variance of the restricted MLE from the MLE, namely, B/£ B; Taking the 
square root of this transformation yields an estimate of the standard devia- 
tion of s = b’By, which, when divided by the degrees of freedom of the model, 
df = ap — (1+ (a — 1) + (p - 1) + (a + p - 2), produces an estimate of the standard 
error of s in the denominator of Equation (5.11). 

Applied to the three alternative CGLIM models for which the renormal- 
ized coefficients are given in Table 5.7, we obtain the following t ratios: 9.59 
(p < 0.001) for CGLIM 1, -3.26 (p = 0.001) for CGLIM 2, and —0.37 (p = 0.712) 
for CGLIM 3. The results show that only the CGLIM 3 models estimates are 
not statistically different from those of the IE, but the CGLIM1 and CGLIM 2 
model estimates are. Reiterating the point made above, the results of apply- 
ing the asymptotic f test to the CGLIM1, CGIIM2, and CGLIMS models show 
that estimators that incorporate significant components of the B, vector (and 
thus the design matrix) may give misleading results concerning the pat- 
terns of change of the estimated effect coefficients across the age, period, and 
cohort categories. Significant f ratios indicate statistically significant depar- 
tures from the IE coefficients and patterns of change that depart substantially 
from those of the IE. The much larger t ratio produced by CGLIM1 means 
that equating age effects is a particularly inadequate constraint to use. This 
statistical result is perfectly consistent with the strong age gradient in cancer 
mortality rates documented in biodemographic and epidemiologic studies. 

In brief, the foregoing test for statistical estimability leads to new avenues 
for using and interpreting the IE as applied to a particular dataset. On the 
one hand, an analyst can apply the IE in an exploratory data analysis manner, in 


* Formally, the asymptotic normality property of the MLE yields b “Y N(b,Y-)), that is, the sam- 
pling distribution of the estimated coefficient vector of a model thatis identified by imposition 
of a theoretically motivated equality constraint on two or more coefficients is asymptotically 
multivariate normal with a mean (expected) parameter vector b and a variance-covariance 
matrix >, where X3 is estimated by Š, the inverse of the Fisher information matrix. By prop- 
erties of the MLE (see, e.g. McCulloch and Searle 2001: 309), a linear transformation of b 
using B, then yields MB, ^" N(BT2B,), where WE = s. From this, the asymptotic standard 
error of s can be computed as the square root of B-B, divided by degrees of freedom as 
indicated in the text. 
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which the objective is to ascertain good estimates of the patterns of A, D, and 
C effects in a table or set of tables of rates.’ In such an exercise, the analyst does 
not approach the data with strong prior notions about particular patterns of 
effects that should be evident in the data, but rather seeks to let the intrinsic 
patterns emerge from application of the IE, taking advantage of the fact that 
the IE is an estimable function and thus has desirable statistical properties. 

On the other hand, using the definition of statistical estimability and the 
test described, an analyst can use the IE in a more confirmatory data analysis 
manner, in which a vector of effect coefficients estimated from application 
of the IE is used as a benchmark to assess whether a corresponding vec- 
tor of coefficients estimated from the imposition of one or more theoreti- 
cally or substantively motivated constraints to achieve model identification 
is acceptable. In this type of exercise, an analyst approaches a table or tables 
of rates with a definite hypothesis or set of hypotheses about the underlying 
A, P, and C effects that generated the data. The analyst can use the vector 
of effect coefficients estimated by the IE to assess the empirical plausibility of 
the hypotheses. In this way, the definition of statistical estimability and the 
test described directly address a criticism that often has been lodged against 
general-purpose methods of APC analysis, namely, that they provide no ave- 
nue for testing specific substantive hypotheses and thus are mere devices of 
algebraic convenience that may be misleading. 


Appendix 5.1: Proof of Unbiasedness of the IE as an Estimator 


of the b, = P,,,,;b Constrained APC Coefficient Vector 


This proof is for the linear model form of the APC accounting model. The 
same results apply, however, for generalized linear models through the 
Fisher information matrix as the linear part of the models remains the same. 

We begin by citing the PCR formulation of the IE noted previously as 
one method of estimation of the IE. Denote by m the number of eigenvalues 
(including the zero eigenvalue) of the matrix X7X in the solution to normal 
equations (4.7) in Chapter 4 and denote by r the rank of the XTX matrix. A 
general property of the principal components estimator (Sen and Srivastava 
1990: 256) in the linear model is that the bias in the principal components 
estimator of the regression coefficient vector B induced by the deletion of 
m -r variables in the regression model (corresponding to eigenvalues equal 
to zero or nearly equal to zero) is O,, dn, In this expression, Ou denotes 
a (m — r) x (m — r) matrix, the columns of which are the eigenvectors corre- 
sponding to the m -r eigenvalues that are zero or near zero, and D. denotes 


* The distinction between "exploratory" and “confirmatory” data analysis dates in statistics at 
least to the classic work of Mosteller and Tukey (1977). 
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the regression coefficients of these eigenvectors. In the present case of the 
APC regression model (4.5) in Chapter 4, m — r = 1, Qm- = By and Boy» = 
s. Since s = 0 in the IE, the unbiased property applies here. Furthermore, 
since an orthonormal linear transformation of an unbiased estimator (which, 
as described in the text, is used in the principal components approach to 
estimation to transform the PCR estimator to the special parameterization 
or linear function b, of the parameter vector b that is estimated by the IE) 
remains unbiased, it follows that the IE estimate obtained by this algorithm 
is unbiased. 


EÁ G 


Appendix 5.2: Proof of Relative Efficiency of the IE as an 


Estimator of the b, = P,,,,;b Constrained APC Coefficient Vector 


We prove the theorem only for linear models as the same results can be 
derived for generalized linear models through the Fisher information matrix. 
Recall that X denotes the design matrix of the APC multiple classification 
model (4.5) in Chapter 4. Let x denote the row vector with elements -1, 0, or 1 
corresponding to the identifying constraint on the parameters in B necessary 
to achieve a unique estimator. Let W denote the augmented design matrix 
in which X has been expanded by the constraint vector x, so that we can 
write WT = [XT |x!]. Thus, WTW is invertible, and by linear model theory, the 
variance-covariance matrix for the constraint estimator is var(b) = o?( WTW)-, 
where c? denotes the variance of the model random error term. 

Let U be the orthonormal matrix such that the matrix XTX is diagonalized, 
that is, XTX = UAUT, where A = diag [M, ..., Minar Am] With A, 2 ... 2 An > 0 and 
Am = 0 being the eigenvalues of matrix XTX. Let A, = diag [Ay, ..., An AL Since 
the IE is a special principal component estimator with the principal compo- 
nents A, >... 2 44, , > 0, by linear model theory 


-1 
var(B) = c?uA?u'? = eu[^ e (5.12) 


where AG denotes the generalized inverse. Note also that WTW = XTX + xx = 
U(A + ZIzZ)UT, where z = xU. , 

By the principal component decomposition of the CGLIM estimator b = 
B+B,, B and B, are orthogonal in the parameter space. Since A, corresponds 
to the variance of B, Z'z corresponds to the variance of B,. Therefore, letting 
z'z = diag [0, ..., 0, c] with a constant c > 0, we have 


An 


var(b) - var(B) = o? um = d Ab | =o 613) 


APC Accounting/Multiple Classification Model, Part I 117 


is positive definite for nontrivial constraint z with c > 0, where U, is the 
mth column vector of matrix U, which is the eigenvector of matrix X1X with 
eigenvalue 0. 


Appendix 5.3: IE as a Minimum Norm Quadratic Unbiased 


Estimator of the b, = P,,,,,b Constrained APC Coefficient Vector 


In the context of the estimation of the b, = P,,;b constrained coefficient vector 
of the APC accounting/multiple classification model for a fixed number of time 
periods p of data, the unbiasedness and relative efficiency of the IE can be com- 


bined with the numerical properties of the IE in the following two properties. 


Property 1: The IE is a well-defined numerical solution to problem of matrix inver- 
sion in the rank deficient linear equation system defined by the APC accounting/ 
multiple classification model. As noted, for example, by Girosi and King (2008: 
237) in discussing the Moore-Penrose generalized inverse solution to rank 
deficient linear systems: “There is always a well-defined solution that lies 
in [the nonnull space of a rank deficient matrix A], that is, a solution whose 
projection on the null space is zero. We take this as the ‘representative’ solu- 
tion.” This is the solution that defines the IE. 

It follows from this property that any solution of the rank deficient linear 
equation system of the APC accounting model that incorporates elements of 
the null space of the system is not a well-defined numerical solution. Does 
this mean that no estimator of the APC accounting model coefficient vec- 
tor b obtained by placing an equality constraint on two coefficients is a well- 
defined solution to the system of equations defined by the model? No. The 
reason is that such an estimated coefficient vector can be estimable in a sta- 
tistical sense; see the definition of statistical estimability in Section 5.6. 


Property 2: The IE is a MINQUE of the projection by = Proj b of the unconstrained 
APC multiple accounting model coefficient vector b onto the nonnull space of the 
design matrix X. The MINQUE method of statistical estimation was proposed 
and developed by the esteemed statistician C. R. Rao (1970, 1971, 1972; see 
also Jiang 2007: 28-29) in a series of papers published at about the time the 
Mason et al. (1973) article articulated the APC accounting/multiple classifica- 
tion model and its identification problem. Concerning the IE as a MINQUE, 
note that we have proven in Appendices 5.1 and 5.2 that for a fixed number 
of time periods of data, the IE is unbiased and has minimum variance as an 
estimator of the coefficient vector bo. The definition of the variance of estima- 
tors is equivalent to choosing a quadratic norm in the underlying Euclidean 
vector space. And, as shown by Girosi and King (2008: 238), “The solution 
[to the rank deficient system of linear equations Ax = y] whose projection in the 
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null space of A is 0 and the solution of minimum [quadratic] norm coincide." 
Since the projection of the IE onto the null space of the design matrix X in 
the APC accounting/multiple classification model is 0, it follows that it also 
is a MINQUE estimator. 


Appendix 5.4: Interpreting the Intrinsic Estimator, Its 
Relationship to Other Constrained Estimators in APC 
Accounting Models, and Limits on Its Empirical Applicability 


Recently, O'Brien (2011a) discussed constrained estimators in the APC 
accounting model and the IE in particular. Fu, Land, and Yang (2011) simi- 
larly discussed these topics and commented on the work of O’Brien (20112), to 
which O'Brien (2011b) responded. Because of the relevance of this exchange 
to the contents of this chapter, we summarize and comment on these articles. 

O'Brien (2011a) showed algebraically how each just-identifying equality 
constraint on the coefficient vector of the APC accounting model is asso- 
ciated with a specific generalized inverse matrix that produces the least 
squares estimator of the resulting constrained coefficient vector. He also 
discussed the geometry of constrained estimators in terms of solutions to 
the unidentified equation system corresponding to various constraints and 
generalized inverses being orthogonal to constraints, solutions to various 
constraints all lying on a single line in multidimensional space, the distance 
on that line between various solutions, and the crucial role of the null vec- 
tor. This then is used to characterize properties of all constrained estimators 
and those that are unique to the IE. Among other things, O'Brien (2011a) 
showed that the identifying constraint used by the IE, which results in the 
Moore-Penrose generalized inverse solution and the various desirable sta- 
tistical properties cited in this chapter and the appendices, produces the 
^most representative" among all possible solutions corresponding to all pos- 
sible just-identifying equality constraints. As Fu, Land, and Yang (2011) and 
O'Brien (2011b) pointed out, these are useful illustrations and derivations of a 
number of properties of all constrained estimators and the IE on which there 
is much agreement. O'Brien (2011b) also identified some disagreements, on 
which we now comment. 

First, as noted in Section 5.3.1, many of the articles on the identification 
and estimation problem published in the 1970s and 1980s were based on the 
presumption that the objective was to estimate the "true" or "generating" 
parameters, that is, those in the unidentified and unconstrained regression 
coefficient vector of the APC multiple classification model that generated the 
observed table of rates or proportions. This objective also motivated O'Brien 
(2011a: 435): "Ihe crucial question, from our perspective, is whether the 
parameter estimates are unbiased in the sense that their expected values 
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equal the values of the parameters that generated Y.” As pointed out by Fu, 
Land, and Yang (2011), however, data on an outcome variable from a regres- 
sion model with linearly dependent covariates can be generated by multiple 
sets of generating parameters. Algebraically, in the case of the APC account- 
ing model, this corresponds to the existence of an infinite set of generalized 
inverses that solve the normal equations for the unconstrained coefficient 
vector of the APC accounting model—geometrically illustrated by the “line 
of solutions" in O’Brien’s (2011a) figures. 

In face of this, as stated in the various articles by Yang, Fu, and Land and 
their associates, and as stated and emphasized in this chapter, the objective 
of the IE is not to estimate the unidentifiable regression coefficient vector 
but rather its projection to the nonnull space of the design matrix, an esti- 
mable function. As Tu, Kramer, and Lee (2012: 592) stated, the IE and related 
estimators are based on the mathematical relation within rows/columns in 
XTX or X'y, and “this relation is a natural consequence of the mathemati- 
cal relations among” the A, P, and C temporal dimensions in the classical 
APC accounting model. After this projected coefficient vector is estimated, it 
then can be used, as shown in Section 5.5, to statistically assess the probabil- 
ity that substantively motivated coefficient equality constraints result in an 
estimated APC regression coefficient vector that also is estimable. This then 
achieves the long-standing search, and that of O’Brien (2011a), for estimates 
of a vector of estimated effect coefficients of the APC accounting model that 
are based on prior research or theory. In addition, the properties of the IE 
and the Moore-Penrose generalized inverse matrix associated therewith that 
have been described in this chapter and appendices will more often than not 
produce useful and replicable empirical estimates of the trends across the 
age, period, and cohort temporal dimensions of the APC accounting model. 

Second, in Section 5.3.1, we showed that the identifying constraint of the 
IE satisfies the estimability condition derived by Kupper et al. (1985) for 
the APC accounting model and thus conclude that the IE is an estimable 
function of the unconstrained coefficient vector b. Yet, O’Brien (2011b: 468) 
stated that the IE “is not an estimable function in the traditional sense, since 
it does not meet the necessary and sufficient condition for an estimable func- 
tion described by Searle (1971: 185).” This statement disregarded the fact that 
the Kupper et al. estimability condition was derived by application of the tra- 
ditional definition and criteria for estimable functions to the APC accounting 
model. It also disregarded Searle’s (1971: 180) characterization of an estimable 
function as “a linear function of the parameters for which an estimator can 
be found from [any generalized inverse solution to the normal equations] 
that is invariant to whatever solution of the normal equations is used.” As the 
exposition in Section 5.3 clearly showed, the IE has this invariance property. 

Third, O’Brien (2011a: 440) was “skeptical” of the asymptotic properties of 
the IE described in Section 5.3.2 that imply that the expected value of any just- 
identified constrained estimator converges in value to the expected value of 
the IE as the number of time periods or age groups increases without bound. 
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In support of this skepticism, O'Brien (2011a) reported results of an analy- 
sis of a simulation of a specific numerical specification of an APC account- 
ing model. As stated by Fu, Land, and Yang (2011: 462), however, even this 
simulation illustrates the convergence properties proven mathematically in 
the articles cited in Section 5.3.2. Note also that the numerical simulations 
reported in Section 5.5 exhibited these asymptotic convergence properties. 
In addition, Fu, Land, and Yang (2011: 463) noted that the nonconstancy in 
the scalar coefficient that O'Brien (2011a) found to vary with differing num- 
bers of time periods of data is due to a different parameterization from that 
used to mathematically prove the asymptotic properties of the IE. 

In the end, as emphasized in this chapter and elsewhere in this book, the 
utility of the constraint imposed on the APC accounting model by the IE 
and the corresponding Moore-Penrose generalized inverse matrix solution 
of the deficient normal equations of the model will be determined by their 
ability to meet Glenn's (2005) criteria as cited in Section 52, that is, to yield 
approximately correct estimates more often than not in empirical applica- 
tions. As concerns this criterion, note again the empirical analyses reported 
in Sections 5.4 and 5.5 and in Chapter 8 that show similarities of inferences 
regarding the temporal patterns of age, period, and cohort effect when those 
estimated by the IE are compared with those estimated with the very dif- 
ferent HAPC model that is the subject of Chapters 7 and 8. In addition, 
using data on U.S. homicide arrest rates, Fu, Land, and Yang (2011: 461-462) 
reported estimates of the temporal trends of the age, period, and cohort 
effect coefficients by the IE that are quite consistent with those estimated 
by O’Brien (2000) using an APC characteristics (APCC) model analysis (the 
APCC model is a variant of the proxy variable approach to identification 
and estimation of the APC accounting model; see Section 4.4.4 of Chapter 4). 
In brief, the prior empirical findings regarding these temporal patterns by 
O'Brien using a proxy-variable-based APC model he previously developed 
are replicable by the IE. Again, however, we do not claim that the IE is a uni- 
versal solution to the APC accounting model identification and estimation 
problem. As emphasized in Section 5.5.2, every statistical model will break 
down under some circumstances, and the IE is no exception. 
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APC Accounting/Multiple Classification 
Model, Part II: Empirical Applications 


6.1 Introduction 


This chapter continues the discussion of the conventional age-period-cohort 
(APC) accounting model but focuses on empirical applications. We first 
describe a three-step procedure that combines descriptive and statistical 
APC analysis using the Intrinsic Estimator (IE) through studies of recent 
cancer incidence and mortality trends by sex and race in the United States. 

The utilities of linear models (LMs)/generalized linear models (GLMs) of 
APC extend beyond the identification and estimation of individual A, P, and 
C coefficients. Long-term trends estimated from such models also can be 
used to make projections/forecasts of the future. The special advantage of 
using APC models relative to conventional techniques for forecasting lies in 
the fact that they take into account cohort effects as an additional source of 
variation. We build on a previous study of APC model-based projection and 
forecasting (Osmond 1985) and illustrate the utility of linear APC models 
in forecasting future trends of mortality from four leading cancers. 


6.2 Recent U.S. Cancer Incidence and Mortality Trends 
by Sex and Race: A Three-Step Procedure 


We now turn to empirical analyses utilizing the new methodology of the 
IE to reveal the age, period, and cohort trends in cancer incidence and 
mortality since the launching of the SEER (Surveillance, Epidemiology, 
and End Results) Program by the National Cancer Institute (NCI) in 1973. 
Previous epidemiologic and demographic studies of trends in cancer mostly 
focused on period changes for specific age groups (e.g, Jemal et al. 2008; 
Manton, Akushevich, and Kravchenko 2009). Those that explicitly took cohort 
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variation into account used either descriptive analyses (e.g., Manton 2000) 
or conventional statistical methods of model identification that resulted in 
inconsistencies of findings due to the use of different identifying constraints 
and assumptions (e.g., Clayton and Schifflers 1987; Kupper et al. 1985; Jemal, 
Chu, and Tarone 2001; Tarone and Chu 2000; Tarone, Chu, and Gaudette 
1997). In general, there is a lack of guidelines for estimating linear APC mod- 
els of rates. 

We integrate descriptive and statistical analyses to more comprehensively 
assess the distinct impacts of age, period, and cohort on the incidence rates 
and mortality rates of 19 leading cancers for males and females. We imple- 
ment a three-step procedure that can be readily applied in other studies using 
the LMs. Step 1 is to conduct descriptive data analyses using graphics. The 
objective is to provide qualitative understanding of patterns of A, P, C, and 
two-way A-by-P and A-by-C variations. Step 2 is nested model fitting. The 
objective is to ascertain whether the data are sufficiently well described by 
any single factor or two-way combination of A, P, and C. If these analyses 
suggest that only one or two of the three effects are operative, the analysis 
can proceed with a reduced model that omits one or two groups of variables, 
and there is no identification problem. If, however, these analyses suggest 
that all three dimensions are at work, we proceed to Step 3—applying the IE 
to make statistical estimates of the net A, P, and C effects. We now illustrate 
these steps in analyses of cancer trends. 


6.2.1 Step 1: Descriptive Analysis Using Graphics 


The graphical presentations of cancer incidence and mortality data described 
in Chapter 3, Sections 3.3.2 and 3.3.3 are shown in Figure 6.1 for 20 cancer 
sites. We focus for now on the age-specific rates arrayed by four time periods 
corresponding to four decades: 1970s, 1980s, 1990s, and after 2000. We also 
analyzed cohort arrays of the age-specific rates but do not present them in 
the interest of space. As we noted in Chapter 4, the detection of cohort varia- 
tion can also be achieved by the comparison of age-specific rates across time 
periods. The results were consistent with similar analyses shown in the work 
of Manton, Akushevich, and Kravchenko (2009: Chapter 6), but extended 
them to include more leading cancers, other races in addition to white and 
black, and years after 2003. 

Total cancer incidence and death rates showed rapid increases after the age 
of 45 for both males and females of all three race/ethnicities. The incidence 
rates leveled off for males and decelerated for females in old age for most 
individual cancer sites, with a few exceptions (leukemia, melanoma of the 
skin, and uterine cervix). Total cancer mortality rates showed largely contin- 
uous increases throughout the life span. The same pattern was observed for 
half of the sites examined. For the other sites, old age mortality rates leveled 
off just as the incidence rates. 
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Age-specific rates of U.S. cancer incidence and mortality by time period. 
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Age-specific incidence and mortality rates for all cancers combined 
increased from the mid-1970s to 1990s and then decreased after 2000. Large 
increases in incidence rates over this period are evident for cancers of lung 
and bronchus, breast, prostate, non-Hodgkin lymphoma, esophagus, liver, 
kidney, brain and other nervous system, melanoma of the skin (for whites), 
and thyroid (for white males). Decreases in incidence rates over time were 
prominent for cancers of stomach, uterine cervix, and oral cavity and phar- 
ynx. Time trends of age-specific mortality rates for individual sites largely 
mirrored those of incidence rates. While most cancer sites showed slight fluc- 
tuations or continuous changes in age-specific mortality rates throughout 
the 40 years, increases were more pronounced in the last 10 years for cancers 
of lung and bronchus, esophagus, and thyroid (for males). 

Sex and race differences in age-specific rates of both incidence and mortal- 
ity were substantial. More males than females were diagnosed with cancers 
of nonreproductive sites and died from these cancers at most ages. One excep- 
tion was thyroid cancer, in which case females showed higher incidence and 
mortality rates. The incidence and death rates were higher for black males 
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than nonblack males for leading cancer sites at most ages. Males of other races 
had lower age-specific incidence and mortality rates for all sites except for 
cancer of liver and intrahepatic bile duct. White females showed higher age- 
specific incidence rates in cancers of breast, ovary, and uterine corpus than 
nonwhite females. But, black females had higher age-specific mortality rates 
of all cancers. There were also sex and race differences in the time trends of 
cancer incidence and mortality rates. For instance, time period increases in 
incidence and mortality rates were larger for females than males in lung can- 
cer, but larger for males than females in esophagus and liver cancers. Black 
and other females experienced increases in mortality from breast and uterine 
corpus cancers, in contrast to decreases for their white counterparts. In the 
case of liver cancer, blacks and other races also experienced larger increases 
in both incidence and mortality rates. On the other hand, blacks and others 
experienced larger declines in rates of esophagus cancer over this period at 
most ages and little increase in skin cancer, in contrast to the apparent uptakes 
of skin cancer incidence and mortality rates over the 40 years for whites. 

There is evidence of nonparallelism among age patterns by time period in 
rates of both incidence and mortality for most cancer sites, indicating cohort 
variations. And, such evidence is particularly strong for cancers of lung and 
bronchus, prostate, breast, esophagus, liver, kidney, melanoma of the skin, 
and thyroid. Therefore, it is likely that cohort effects are more prominent in 
these cancers than others. 


6.2.2 Step 2: Model Fit Comparisons 


The foregoing two-dimensional graphical analyses suggested some interest- 
ing patterns of the age changes and temporal dynamics of top cancer sites. 
Although they have implications for the A, P, and C effects that generate 
the observed data, they are descriptive measures that do not explicitly and 
effectively assess these effects. We already discussed their limitations in 
Chapter 4. We next employ statistical models to estimate these effects. The 
first question that needs to be addressed is which model best summarizes 
the data. We can compare measures of model fit, such as R squared for linear 
regression models (LMs) or deviance statistics and penalized functions for 
GLMs, to determine the relative importance of the A, P, and C dimensions. 
Table 6.1 exemplifies this model comparison process for fitting log-linear 
models to total cancer incidence and mortality rates. 

We first estimated three reduced log-linear models: a gross age (A) effects 
model and two two-factor models De, AP and AC effects models. We then 
estimated the full APC model in which all three factors are simultaneously 
controlled. We used the IE to estimate the full APC model. We have already 
noted that the specific identifying constraint chosen does not affect the fit 
for just-identified models. Goodness-of-fit statistics were calculated and used 
to select the best-fitting models for male and female mortality data. Because 
likelihood ratio tests tend to favor models with a larger number of parameters, 
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TABLE 6.1 


Goodness-of-Fit Statistics for Log-Linear Models of All-Site Cancer 
Incidence and Mortality Rates 


Incidence 
Male Female 
Models A AP AC APC A AP AC APC 
DF 126 119 102 96 126 119 102 96 
Deviance 103078 47503 11452 2643 75977 64163 6435 3277 
AIC 104813 49252 13235 4438 77696 65896 8202 5056 
BIC 104866 49326 13360 4581 77749 65970 8326 5199 
Mortality 
Male Female 
Models A AP AC APC A AP AC APC 
DF 126 119 102 96 126 119 102 96 
Deviance 21624 8450 17710 4751 7720 2954 3467 1881 
AIC 23199 10039 19333 6387 9304 4552 5099 3525 


BIC 23253 10114 19458 6529 9358 4626 5224 3668 


Note: Best models are indicated in bold. 


we calculated two most commonly used penalized-likelihood model selec- 
tion criteria, namely, Akaike’s information criterion (AIC) and Bayesian 
information criterion (BIC), which adjust the impact of model dimensions on 
model deviances. Because age is a fundamentally important source of varia- 
tion in morbidity and mortality, models with no A effects (P, C, and PC) have 
significantly had lack of fit and are omitted. For both incidence and mortality 
data on all cancer sites combined, both AIC and BIC statistics implied that the 
full APC models fit the data significantly better than any reduced models. 
The comparison of model fit using the BICs for individual cancer sites is 
summarized in Tables 6.2a and b Goodness-of-Fit Statistics for Log-Linear 
Models by Cancer Site, for incidence and mortality, respectively. We did not 
estimate models for thyroid cancer mortality for blacks and others due to 
the low death counts and potentially large sampling errors for these groups. 
While the full APC models best fit the data for a large portion of cancer sites, 
there were variations in best-fitting models by site, outcome, sex, and race. 
For incidence rates in males, the AP models are the best models for cancers 
of kidney and renal pelvis, pancreas, and myeloma, whereas the AC models 
were the best models for cancers of esophagus and brain and nervous sys- 
tem. Whereas the APC models fit the best for incidence of most cancer sites 
for white males, A and AP models were the better-fitting models for black 
and other males for most sites. Similar patterns held for females, with more 
variation by site. In particular, the APC models were the best-fitting models 
for only three sites in black females and no site in other females. Results for 
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mortality rates were largely consistent, with the APC models the best-fitting 
models for more sites and nonwhite groups. 

This was the first systematic analysis of the relative importance of temporal 
components in cancer incidence to our knowledge. It suggested that cohort 
effects are not only specific to cancer site and outcome but also strongly 
dependent on social demographic status. The cohort process of change is 
more prominent in understanding trends of cancer mortality than cancer 
incidence, more evident in males than females and for whites than blacks 
or others. This suggests that the development and diagnosis of cancer were 
more strongly related to the biological process of aging and period changes in 
diagnostic techniques, whereas survival from cancer was also strongly influ- 
enced by life course accumulation of exposures to risk factors for mortality. 
Sex and race variations in cohort effects further suggested the importance 
of the social stratification process. Sociological theories provide support for 
this finding in that the diffusions of health risk factors such as smoking and 
obesity and technological innovations are experienced earlier by the socially 
more advantaged groups (white men with higher socioeconomic status) than 
the less-advantaged groups (women and racial/ethnic minorities) (Pampel 
2005). Differential lengths of exposures at different points in the life course 
then likely are transformed into cohort differences that are more visible for 
the former than the latter. 


6.2.3 Step 3: IE Analysis 


We next examined the A, P, and C effects on cancer incidence and mortality 
rates based on the results of model comparison analysis using the best-fitting 
models for each site, outcome, and demographic group. We have already 
shown the numerical results from estimation of the full APC models using 
the IE method in Table 5.7 in Chapter 5 for female total cancer mortality 
rate. To facilitate the comparisons of the age and temporal effects across sub- 
populations and sites, we plotted the point IE estimates of effect coefficients 
by sex and race when the full APC models were the best-fitting models for 
the sex-race groups. The resulting graphs, shown in Figure 6.2, compare the 
coefficients of successive categories within A, P, and C classifications and 
show the net A, P, and C effects of incidence and mortality. Within each site 
and cancer outcome (incidence or mortality), we used the same scale so that 
the relative magnitude of A, P, and C effects can be easily discerned. The age 
and cohort ranges of these results varied by site, sex, and race because the 
counts of incidence and death at younger ages for more recent cohorts were 
too low to yield reliable estimates for some groups. A caveat for interpret- 
ing the sex- and race-specific results is that these are not absolute levels of 
mortality rates, but model effect coefficients. The former can be compared 
directly based on the same scale, such as deaths per 100,000 persons, but the 
latter only indicate relative trends within each population. The model-based 
results on A, P, and C effects largely accord with the observed patterns from 
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Intrinsic Estimator coefficient estimates of the age, period, and cohort effects on cancer inci- 
dence and mortality for selected cancer sites. 


descriptive displays of the data, but they also offer additional insights of and 
succinct summaries of distinct sources of incidence and mortality changes. 


6.2.3.1 All Cancer Sites Combined 


The IE analysis of the incidence and mortality changes of all cancer sites 
showed the dominance of A and C effects over P effects. The age curves of 
incidence rates showed exponential increases before old ages but decreases 
during old ages for most groups. Similar exponential increases occurred for 
cancer mortality rates at all ages. While there appeared some slight decel- 
erations in the rate of increase around the age of 50, there were no signs of 
declines in mortality rates for any group in old ages. The A effects showed 
more pronounced sex and race differences than the P or C effects. Specifically, 
females showed slightly less-steep age accelerations in incidence and mortal- 
ity rates than males, and black males showed steeper age accelerations after 
middle ages than nonblack males. 

While the A effects were highly consistent with age patterns of rates 
shown in the graphical analysis in Step 1, the results regarding distinct P 
and C effects were not immediately obvious in descriptive plots. There is 
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FIGURE 6.2 (continued) 
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an important distinction of the period changes observed from the Step 1 
graphical display of data and those based on APC models. That is, the period 
changes in the observed rates may be confounded with changes that are age 
or cohort related, whereas those from the model estimates represent true P 
effects that are net of A and C effects. When all three factors are considered 
in the APC models, there seem to be only modest P effects, but substan- 
tial C effects. The P effects showed log-linear trends, indicating increases 
in cancer incidence and mortality over the last 40 years. On the other hand, 
the cohort declines in cancer incidence were monotonic and continuous, and 
those in cancer mortality started later, reflecting a lagged effect of incidence 
on mortality. The successive cohort declines largely support the contention 
that nutrition, reduced exposure to carcinogens, and medical measures all 
contributed to increasing health capital in more recent cohorts that reduced 
the onset of malignancies and improved survival. 


6.2.3.2 Age Effects by Site 


The A, P, and C effects showed different patterns across sites, but the domi- 
nance of A effects was evident for most sites, suggesting the influence of can- 
cer biology on age patterns of both incidence and mortality. The incidence 
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rates of malignancy in most tissues followed the same age pattern of 
increases, with four notable deviations. The A effects of female breast cancer 
incidence were indicative of the etiology of the disease: The sharp increases 
in disease risk in early ages were more strongly genetically determined, and 
the increases in risk slowed at the age of menopause and plateaued with a 
peak around the ages of 65-70. The results for cervical cancer were similar, 
but the plateau appeared much earlier, starting around the age of 40. The 
A effects of esophageal cancer incidence showed modest increases between 
ages 35 and 55 and gradual declines afterward. The relative magnitude of 
the A effects, in this case, was similar to that of the P effects and smaller than 
the C effects. The A effects of the thyroid cancer incidence showed a bimodal 
pattern for white males, with the first peak in the 20s and the second higher 
peak around the age of 65 and a long plateau for white females after the 
age of 20. Mortality in nine sites showed continuously increasing A effects 
that were similar to those for all sites combined. These sites include prostate, 
colorectal, urinary bladder, stomach, leukemia, non-Hodgkin lymphoma, 
oral cavity, and melanoma of skin. Mortality in the remaining 11 sites showed 
a similar pattern with incidence data, that is, a concave pattern of A effects 
that increased from early adulthood to peaks in old ages and then leveled off. 
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6.2.3.3 Period Effects by Site 


The IE estimates of the P effects on cancer incidence showed increasing 
trends for all but one site. The period trend for cervical cancer incidence 
was clearly downward, reflecting the success of cancer screening through 
Pap smear in prevention and control of new cases of malignancies at this 
site. Period changes in incidence rates at other sites can be related to changes 
in exposures to carcinogens and behavioral risk factors. For example, the 
increases in lung cancer incidence can be due to increases in tar and nicotine 
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in cigarettes (particularly for females), drug use, and air pollution. The obe- 
sity epidemic can have widespread impacts on cancer sites that are linked 
to fat and glucose metabolism and reproductive hormones. And, period 
changes can also result from changes in the use of diagnostic and screening 
procedures. For instance, the increases in breast cancer incidence accompa- 
nied increasing use of mammography, which increased detection, whereas 
the slower increment in breast cancer incidence between 2000 and 2005 may 
have followed from reduced use of hormonal replacement therapy (HRT). 
The increases in the colorectal cancer incidence rates could have resulted 
from increased use of colonoscopy and detection of cancerous adenomatous 
polyps. Trends in the frequency of prostate-specific antigen (PSA) testing 
have also influenced prostate cancer incidence trends. The increasing inci- 
dence of several other cancers, such as melanoma of the skin, kidney, renal 
pelvis, and thyroid cancers, could all be related partly to diagnostic and 
imaging technology (Jemal et al. 2008). 

The IE estimates of P effects on cancer mortality were more varied by site 
and demographic group. While mortality in most sites showed increasing 
trends of P effects, mortality due to cancers of stomach and oral cavity (for 
whites and black males) changed little over time, and leukemia (for whites 
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and blacks) and cervical cancer mortality decreased over time. For lung 
cancer mortality, females had a steeper increase than males throughout 
the 40-year time period, which was consistent with the results on lung can- 
cer incidence. For breast, colorectal, liver, and oral cavity cancer mortality, 
females of other races showed steeper increases than the rest of the groups. 
For prostate, kidney, and myeloma cancer mortality, males of other races 
showed steeper increases than the other groups. For leukemia mortality, 
both males and females of other races showed increases, while whites and 
blacks showed decreases in period trends. In the cases of urinary bladder 
and esophageal cancer mortality, blacks had decreases, while other groups 
had increases in period trends. In the case of non-Hodgkin lymphoma mor- 
tality, black females also showed decreases over time. On the other hand, 
black males and females showed steeper period increases in skin cancer and 
thyroid cancer mortality than nonblacks, respectively. 

As shown in the Step 2 model comparison analysis, the C effects that sig- 
nificantly contributed to the overall temporal variation in incidence and 
mortality did not occur for all sites and groups. However, they accounted 
for changes in a substantial proportion of major sites for both sexes. The IE 
estimates of the C effects revealed myriad patterns of cohort changes that 
require a more detailed description. We discuss the results of cancer inci- 
dence and mortality separately in the following material. 


6.2.3.4 Cohort Effects on Cancer Incidence 


Pronounced cohort declines in rates of newly diagnosed malignancies were 
evident for a vast majority of sites. The shapes of the cohort curves of these 
sites, however, varied. The C effects of lung cancer incidence were consis- 
tent with the characteristic inverse-U shape that showed increases in earlier 
cohorts and decreases for subsequent cohorts (Gardner and Osmond 1984; 
Jemal, Chu, and Tarone 2001; Lee and Lin 1996; Tarone and Chu 2000). The 
increases and peaks of the C effects were delayed but more pronounced 
for females, especially black females. Such effects suggest the influence of 
gender-specific risk factor exposure and behavioral differences, especially 
cigarette smoking, in the life experience of specific birth cohorts. Studies 
have found diverse patterns of smoking across cohorts and the presence 
of sex differences, with recent increases in smoking in later cohorts of 
females (Harris 1983; Zang and Wynder 1996). Male cohorts adopted ciga- 
rettes in large numbers earlier than females, and the pattern preceded that 
of females by a decade or two (Lopez 1995). The decreases in incidence 
for subsequent cohorts may reflect the benefits of smoking cessation. The 
increases in the most recent three birth cohorts were unexpected and may 
be attributed to sparse data for these cohorts. In general, the model esti- 
mates for the earliest and most recent birth cohorts should be interpreted 
with caution because they were based on the few observations at the lower 
left and upper right corners of the age-by-period matrices, respectively, 


166 Age-Period-Cohort Analysis 


and were accompanied by larger standard errors than estimates for mid- 
dle cohotts. 

Cohort declines in incidence rates leveled off in more recent cohorts for 
four sites: colorectal, stomach, kidney, and uterine corpus cancers. Increases 
in more recent cohorts appeared for prostate, liver, and cervical cancers. 
The slowdowns or reversals of continuous cohort improvement seem to 
have initiated in the baby boomers born between 1946 and 1964, suggesting 
the need for better strategies of prevention and control that are specific to 
these and succeeding cohorts. On the other hand, incidence rates for mela- 
noma of the skin and thyroid cancer showed little changes for earlier cohorts 
and small declines for the baby boomer cohorts and successors. The cohort- 
related changes in these two sites, however, were modest compared to age 
and period-related changes and confined to whites. While the C effects for 
the majority of sites are relevant only for whites, especially white males, the 
C effects apply only to blacks in the case of esophageal cancer incidence. 


6.2.3.5 Cohort Effects on Cancer Mortality 


Cohortrelated changes in cancer mortality were prominent and consis- 
tent in trends for most sites and demographic groups. Site-specific cohort 
changes were remarkably similar to the cohort changes in all sites combined 
and showed largely continuous declines in mortality across successive birth 
cohorts. There were, however, sex and race differences in these trends within 
and across sites. The cohort changes in lung cancer mortality mirrored those 
in lung cancer incidence in terms of sex differences. That is, the lag between 
the peaking male and female birth cohorts can be attributed to the sex dif- 
ferences in stages of cigarette diffusion (Yang 2008). For cohorts born before 
1925, racial groups diverged in trends of change, with blacks showing more 
increases than whites and others. Compared to the lack of black and white 
differences in male incidence rates, this finding suggests the black disadvan- 
tage in lung cancer prognosis and survival in the earlier cohorts. The black 
disadvantages were pronounced for four additional sites. Specifically, while 
other groups showed cohort declines, black females showed continuous 
cohort increases in non-Hodgkin lymphoma and oral cavity cancer mortal- 
ity; black males showed cohort increases in urinary bladder mortality; and 
blacks also showed increases in esophageal cancer mortality for cohorts born 
before 1930. 

In some other cases, however, racial ethnic minorities showed more improve- 
ment across cohorts than whites. Black males and females experienced much 
stronger cohort declines in skin and thyroid cancer mortality, respectively. 
And the other races showed steeper cohort declines in prostate, breast, 
colorectal, leukemia, kidney, and oral cavity cancer mortality. The one site 
in which the other race group showed a clear disadvantage was liver cancer 
mortality: Females born between 1925 and 1945 experienced an increase. The 
differences discussed so far pertain to data for which the full APC models 
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are the best fit. It should be noted, then, that the C effects do not add to the 
model fit for the other race group for most sites or for the blacks for some sites. 
Therefore, cohort declines in cancer mortality from leading sites were gener- 
ally more prevalent for whites than blacks or the other races. 


6.2.4 Summary and Discussion of Findings 


Although U.S. cancer trends for the past few decades have been widely docu- 
mented, the sources of cancer incidence and mortality experiences attribut- 
able to distinct impacts of age, period, and cohort are not clear or understood. 
Findings from most previous studies using APC analysis were tentative due 
to limitations in data and analytic methods. And, evidence regarding the rel- 
ative importance of P and C effects in recent cancer trends was particularly 
scant. We used a set of comprehensive analyses, including a new method 
of estimation for the conventional GLM approach to APC analysis, to shed 
some new light on our progress during the 40 years of the war on cancer. 
These analyses improved previous studies in several regards. First, the new 
model estimation approach yielded results that can serve as objective criteria 
for selecting the best among alternative summaries of data and provided a 
test of whether the observed pattern revealed in descriptive analysis was 
real or random. Second, delineation of different sources of variations in inci- 
dence and death rates brought clarity to the specific components of cancer 
dynamics in the recent historical period and highlighted important period- 
and cohort-specific factors that are operational in the changes of the popula- 
tion risks of malignancies and improvement in survival. Third, the empirical 
assessment of cancer trends using the most recent data available may help 
to generate theoretical insights into continuing cancer mortality reductions 
and reveal previously unknown forces affecting health and mortality condi- 
tions in this country that likely will continue into the future. 

A major finding is the strong presence of cohort differences in cancer inci- 
dence and mortality from major sites. Whenever the C effects emerged, they 
tended to dominate the P effects in explaining recent cancer trends. This was 
particularly the case for cancer mortality. This finding largely supports the 
theory of technophysio evolution, which implies large cohort improvements 
in health capital and physiological capacities (Fogel 2004), and the “cohort 
morbidity phenotype” hypothesis, which emphasizes the long-lasting 
impacts on morbidity and mortality of reduced early life exposures to infec- 
tion and inflammation (Finch and Crimmins 2004). The effects associated 
with birth cohorts reflected processes of differential cohort accumulation 
of lifetime exposures to risk factors for cancer, such as education, diet and 
nutrition, physical activity, and smoking. If the finding on the C effects is 
valid, then the standard models of period changes such as those shown in 
descriptive analyses and Equation (4.8) in Chapter 4 are misspecified. The 
role of C effects in cancer trends and beyond undeniably has serious implica- 
tions for measurement and analysis. The lack of cohort declines in incidence 
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and stagnations of declines starting from the baby boomer cohorts for some 
cancer sites have important implications for future trends and are worthy of 
additional attention. 

It merits discussion that the P effects were generally small or modest when 
the C and A effects were simultaneously controlled. The model comparison 
analysis showed numerous cases for which there were virtually no P etfects 
(e.g., ovarian cancer incidence for white females and brain cancer mortal- 
ity for blacks). Most site-specific results showed monotonically increasing, 
albeit small, P effects that suggested log-linear trends. The finding of weak P 
effects relative to the C effects is an insight from the IE analysis that cannot 
be obtained from descriptive analyses that characterize most cancer trend 
studies (e.g., Manton, Akushevich, and Kravchenko 2009). This means that 
the anticipated cancer incidence and mortality declines have been mostly 
captured by or attributed to the C effects rather than P effects. This result is 
largely consistent with previous epidemiologic studies using APC models 
(e.g., Osmond 1985) and comprehensible in light of the different mechanisms 
underlying independent P and C effects. Because successive cohorts expe- 
rienced more favorable historical and social conditions, they not only had 
lower exposure levels to socioeconomic, behavioral, and environmental risk 
factors but also benefited from reduction in the exposure earlier in life course 
than previous cohorts. In this sense, mortality reductions as a consequence 
of advancements in medical measures—measures that, once introduced and 
found to be effective, tend to persist—manifested as cohort declines because 
the cumulative effects were more pronounced in successive cohorts com- 
pared to periods. 

The increasing P effects of cancer mortality are intriguing. They could be 
related to increases in cancer incidence rates that reflect actual increases in 
exposures to carcinogens and biobehavioral risk factors or expanded use of 
effective diagnostic techniques. Future research should further investigate 
the role of these and related factors in explaining this trend. An alternative 
explanation to the increasing P effects is a demographic one. That is, the P 
effects represent a special age-cohort interaction implied by the APC model- 
ing framework. It has been shown that recent cohorts, when exposed to new 
favorable epidemiologic conditions earlier in life, may experience greater 
delays of deaths to more advanced ages than cohorts already old at the time 
of exposure to these conditions (Guillot 2006). The postponement of propor- 
tions of deaths in succeeding cohorts to older ages may thus create a seem- 
ingly increasing period trend. Such a trend may result from the beneficial 
effects of early detection and treatment of cancer patients and not represent 
worsening conditions. There is evidence that the composition of death dis- 
tributions actually changed by cause. Data from an earlier study by Yang 
(2008) showed a 50% increase of proportions of lung cancer deaths and a 
42% increase of proportions of breast cancer deaths from the elderly popu- 
lation 65 and over in the United States between 1960 and 1999. Combining 
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the estimated P and C effects, it can be inferred that there were not only 
marked decreases in the level of cancer mortality across cohorts, but also 
postponement in the timing of cancer-related deaths across cohorts during 
this period. 

We conducted analyses on a comparative basis between different tumor 
sites and populations, such as two sexes and three race groups. Mortality 
declines across cohorts occurred for some groups and major sites that are 
leading causes of cancer deaths. But, the overall improvement was less 
impressive for blacks and other races. While the latter group had generally 
lower cancer incidence and mortality rates and less to improve on, blacks 
suffered substantial disadvantages in cancer survival. The results on the het- 
erogeneity in temporal dynamics can thus be used to identify populations 
with significantly higher or lower incidence of malignancy in various tissues 
so that the search for risk factors associated with the disease may be more 
selective and specific. 

In sum, the systematic study of sources of variations in cancer incidence 
and mortality has established patterns and phenomena not known before 
that require further explanations. It thus can be used as a triage for refined 
analysis of explanatory factors underlying period and cohort processes of 
change. Prominent factors include cigarette smoking, diet, sedentary life- 
styles and related obesity, among other debilitative behaviors and unfa- 
vorable environmental factors amenable for modifications. Taking these 
variables into account in APC analysis will further improve our understand- 
ing of the prospect of future cancer mortality reductions. How to accomplish 
this goal using a different family of models is shown in the next chapter. 


6.3 APC Model-Based Demographic Projection and Forecasting 


The methods and techniques introduced all pertain to the identification of 
individual coefficients of A, D, and C effects in an APC model. A different, 
but related, question is how the inclusion or exclusion of A, P, or C effects 
may change the overall fitted values using the model. We have shown in the 
Step 2 analysis that the nested model-fitting exercise is most helpful in the 
choice of best set of variables to describe the data. Building on the results 
of this analysis, one can further obtain the predicted values from the best- 
fitting models for the purpose of projection in the outcome of interest. We 
now extend the study of recent trends in cancer mortality to show how to 
use APC models for projection and forecasting of future trends. Before we 
describe the method and present empirical results, we first introduce the 
theoretical rationale for this approach to forecasting (Reither, Olshansky, 
and Yang 2011). 
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6.3.1 Two-Dimensional versus Three-Dimensional View 


The fact that both period and cohort effects are important in explaining the 
trends in cancer mortality motivates the use of APC models in the estimation 
of future rates of mortality because they take into account both effects and 
are likely to be more accurate than methods that ignore cohort changes. The 
advantage of this approach to forecasting can be best understood in the con- 
text of health and longevity. 

Many researchers have simply extrapolated the historical trend forward in 
time as the basis for conclusions about population health and prospect of life 
expectancy. We term this two-dimensional forecasts. They are derived from 
age-specific health statistics such as death rates from specific causes, which 
are plotted over a series of periods of observation. The only two variables 
in use are age and time period. These data are then extrapolated into the 
future via various methods. Commonly used metrics of population health 
such as life expectancy based on these mortality rates then provide an esti- 
mate of average duration of life under the stationary population assump- 
tion that a hypothetical birth cohort experiences the death rates observed 
at all ages in a given calendar year throughout their lives. Therefore, the 
implicit assumption underlying two-dimensional forecasting is that younger 
cohorts, still alive, will be identical to their predecessors as they age, and that 
future improvements in health and longevity, already experienced by older 
cohorts, will continue at an identical pace for younger cohorts throughout 
the remainder of their lives. 

By ignoring the third dimension of variation, namely, birth cohort varia- 
tion, the two-dimensional approach fails to account for the time lag between 
risk factors or improved conditions now present among younger cohorts and 
their likely influence on the health status of these cohorts as they age. In 
other words, although death rates observed today provide an accurate reflec- 
tion of a lifetime of past behavioral, environmental, and epigenetic factors 
accumulated by the recently deceased, these statistics provide a limited two- 
dimensional vision of the future because they fail to take into account the 
health conditions of people still alive—particularly children and adolescents. 
A more accurate gauge of the future of health and longevity requires a three- 
dimensional view that accounts for this time lag. 

If environmental and health conditions were constant or always improv- 
ing on a steady and predictable course, the underlying assumption in two- 
dimensional forecasts would be plausible. However, history shows that 
two-dimensional forecasts have been inaccurate—sometimes with dire pol- 
icy consequences. Under conditions of declining cohort mortality, they will 
underestimate actual life expectancy. For example, many pension plans have 
become strained or bankrupt because they failed to account for declines in 
smoking and improvements in lifestyles and medical technologies that led 
to improved survival among employees from these birth cohorts (Olshansky, 
Carnes, and Mandell 2009). Under conditions of rising mortality, the same 
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approach will overestimate actual life expectancy. For example, the child- 
hood and adult obesity epidemics have indicated rapid and dramatic changes 
in health conditions among recent birth cohorts that may be eroding hard- 
won gains in health and longevity, as predicted by the three-dimensional 
model (Reither, Olshansky, and Yang 2011). 


6.3.2 Forecasting of the U.S. Cancer Mortality Trends 
for Leading Causes of Death 


Previous forecasters have generally relied on simple extrapolative methods 
applied directly to age-specific mortality rates. We show how the extrapo- 
lation of coefficients from APC models fit to these rates can produce sig- 
nificant gains in forecasting accuracy. We chose mortality data on several 
leading cancer sites that showed strong cohort effects, including lung and 
colorectal cancer mortality for men and women, female breast cancer, and 
male prostate cancer mortality for whites and blacks. The example of APC 
models using the IE as the method of identification has provided a summary 
of the observed rates and attributed changes to both time period and birth 
cohort in addition to age. The model fit statistics Oe, the BICs) showed that 
the full models accounting for the C effects are the models of choice. The 
strong presence of cohort effects, therefore, is likely to affect the predicted 
mortality rates and produce different future trends compared to methods 
that ignore them. 


6.3.2.1 Methods of Extrapolation 


Two approaches conventionally implemented in demographic projection and 
forecasting are simple linear extrapolation and time series methods. The sim- 
ple linear extrapolation employs linear regression models to extrapolate the 
mortality rates in the base period to the future. It has been used as the bench- 
mark for comparison in the previous study of the APC model-based forecast- 
ing by Osmond (1985). An alternative is to use the autoregressive integrated 
moving average (ARIMA) model for forecasting time series trends (Box, 
Jenkins, and Reinsel 1994). The simple linear extrapolation focuses on the 
linear trends and imposes a fixed rate of change based on past data, whereas 
the ARIMA model may apply to phenomena experiencing variable rates of 
change by incorporating local stochastic trends or universal fixed trends, 
thus providing flexibility in accounting for most variations (McNown and 
Rogers 1989). The ARIMA approach has been the basis for many recent fore- 
casting studies using the time series method (Lee and Carter 1992; McNown 
and Rogers 1989; Shibuya, Inoue, and Lopez 2005). Instead of directly pro- 
jecting age-specific mortality rates into the future, we apply these methods 
to projections of APC model coefficients to obtain fitted values of mortality 
rates. Because the APC models provide the best summary and character- 
ization of the forces underlying the mortality trend data, coefficient-based 
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extrapolation should be superior to the conventional methods of data extrap- 
olation, which fail to take into account birth cohort variation. 

To test this proposition, we compared the performance of two methods: 
the non-APC model-based, or “simple,” method and the APC model-based 
method. For the simple projection, we fit, for each age group, (1) a linear 
regression model of mortality rates as a function of period year and (2) an 
ARIMA model of the time series of mortality rates and obtained predicted 
mortality rates from these models for the projection period. For the APC 
model coefficient-based projection, we first fit a linear regression model and 
an ARIMA model to the set of period effect coefficients and cohort effect 
coefficients estimated by the IE method, assuming the age effects to be con- 
stant over the projection period. We then combined the age effect coefficients 
and predicted period and cohort effect coefficients from the previous step 
to obtain the predicted mortality rates. In the application of the linear coef- 
ficient extrapolation method, the number of coefficients used or the weight 
applied to each of these coefficients would affect the prediction outcomes 
(Osmond 1985). So, we compared different weighted linear extrapolations 
of period and cohort coefficients. The numbers of period coefficients in 
use ranged from all to the last two periods. We examined three options of 
weighting cohort coefficients, including all, the last 10 cohorts, and the last 
5 cohorts. For the time series extrapolation, we estimated and identified the 
ARIMA models that yielded the best fit for both period and cohort coeffi- 
cients according to the method of Box, Jenkins, and Reinsel (1994). 

When using linear extrapolations, the particular choice of constraints to 
identify the model coefficients does not affect the fitted values because all 
just-identified APC models have an identical fit to the data. That is, the use of 
the CGLIM (constrained generalized linear model) or IE affects only the esti- 
mates of individual effect coefficients of the A, P, and C variables, but not the 
predicted mortality rates. When using nonlinear extrapolations such as the 
ARIMA, the predictions using CGLIM and IE coefficients will be somewhat 
different, however. We employed the IE coefficients throughout the forecast- 
ing analyses given previous theoretical findings on the IE's desirable statisti- 
cal properties. 


6.3.2.2 Prediction Intervals 


The uncertainty associated with both the model coefficients and their extrap- 
olation will lead to uncertainty in the point forecasting of future mortality 
rates. It is, therefore, important to quantify this uncertainty through predic- 
tion intervals (PIs) or interval forecasting. We derived the PIs using the boot- 
strap method. Readers who are not interested in the algebraic details can 
skip this section and directly apply the codes we provide online. We adopt 
the scheme of resampling residuals instead of the case resampling scheme 
usually implemented in the bootstrap method (Wu 1986). Because the age, 
period, and cohort components of our model have a special dependency 
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structure, the use of a case resampling scheme would miss some specific 
cases in each run and make it impossible to estimate certain periods or 
cohorts efficiently. The residual resampling scheme, on the other hand, has 
the advantage of maintaining the structure of the regressors and thus is 
more appropriate in the context of APC models. The algorithm of the boot- 
strap method is included in Appendix 6.1 for reference. Note that the PIs 
generated by the bootstrap procedure are more conservative (wider) than 
model-based ones. Therefore, although the IE produces statistically more 
efficient point estimates with smaller standard errors than other estimators, 
the PIs for predictions using different APC model estimators derived from 
the bootstrap method are asymptotically similar. 


6.3.2.3 Internal Validation 


To examine how well simple and APC model-based methods perform in fore- 
casting, we first conducted an internal validation analysis. That is, we with- 
held mortality rates of the most recent two periods as the actual or observed 
rates, applied the simple and APC model-based extrapolations to the mortal- 
ity rates in the base period and then compared the predictions generated 
from these two methods with the observed rates in the projection period. In 
addition, this analysis also allowed us to assess how sensitive the predicted 
values were to the choice of assumptions in coefficient extrapolation. The 
results of the comparison are summarized in Table 6.3 for the case of male 
lung cancer mortality. 

For ages 30 to 85+ in the eight periods from 1969-1974 to 2005-2007, the 
observed death rates are listed in the first line. We fit the log-linear APC 
models to the observed death rates in the base period of 1969 to 1999. The 
fitted values are shown in the second line. We then extended period effect 
coefficients to the next two prediction periods of 2000-2004 and 2005- 
2007 and cohort effect coefficients to the 1965-1969 and 1970-1974 cohorts. 
The coefficients used and results of linear and ARIMA extrapolations of these 
coefficients are shown in the left three panels in Table 6.4. The ARIMA algo- 
rithm automatically chooses the best model that fits the data (IE coefficients). 
For the linear coefficient extrapolation, we found that different numbers of 
cohort coefficients used only affected the first few age groups (e.g., 30-34 and 
35-39) in short-term forecasting and produced similar results even within 
these age groups, which is consistent with the previous study by Osmond 
(1985) and the IE coefficient estimates of cohort effects that showed rela- 
tively linear trends (Figure 6.3). Thus, we present results that used the last 10 
cohort coefficients. Different weighted linear extrapolations of period coef- 
ficients made considerable differences in predicting mortality rates, with 
the best predictions generated from weights applied to the last few periods. 
In the case of lung cancer mortality internal forecasting, the best predicted 
values were from extrapolations using the last three period coefficients for 
males and the last two for females. The analysis of other cancer sites showed 
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TABLE 6.4 
APC Model Coefficients and Projections for Lung Cancer in Males in the U.S. 


Internal Validation Forecasting 


Coefficient linear ARIMA Coefficient linear ARIMA 


Period 

1969-1974 —0.34 —0.27 

1975-1979 -0.17 —0.15 

1980-1984 —0.03 —0.06 

1985-1989 0.09 0.03 

1990-1994 0.20 0.08 

1995-1999 0.25 0.09 

2000-2004 0.34 0.31 0.11 

2005-2007 0.42 0.36 0.17 

2010-2014 0.22 0.22 
2015-2019 0.28 0.28 
2020-2024 0.33 0.33 
2025-2029 0.39 0.39 
Cohort 

-1885 0.46 0.26 

1890 0.60 0.43 

1895 0.71 0.58 

1900 0.72 0.64 

1905 0.71 0.68 

1910 0.66 0.67 

1915 0.57 0.62 

1920 0.46 0.56 

1925 0.37 0.52 

1930 0.23 0.42 

1935 0.02 0.26 

1940 —0.24 0.07 

1945 —0.54 —0.16 

1950 —0.87 —0.41 

1955 —1.05 —0.55 

1960 —1.31 -0.72 

1965 —1.48 —1.01 

1970 -1.73 —1.65 -1.37 

1975 -1.96 —1.82 —1.50 

1980 —1.70 -1.62 
1985 -1.91 -1.75 
1990 -2.13 -1.87 


1995 -2.35 -1.99 
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The IE coefficient estimates of lung cancer mortality by sex: cohort and period effects. 


similar findings. The best predictions were from weighted LMs fit to the 
last three period coefficients for white female breast cancer mortality and 
the last two coefficients for all the others (male and female colorectal cancer, 
white and black prostate cancer, and black breast cancer). These findings are 
consistent with previous cancer-forecasting studies (Bray and Meller 2006; 
Moller et al. 2003) that suggested that rapid changes in cancer-screening 
techniques and treatments have occurred in more recent years, reducing the 
relevance of data from the more distant past for forecasting. In this case, it is 
more reasonable to allow greater weight to be given to more recent data. In 
addition, the IE estimates of period effects based on all periods of data illus- 
trated in Figure 6.3 further suggest empirically the advantage of using the 
last several coefficients. The period coefficients for female lung cancer mor- 
tality had one inflection point at year 1990. Therefore, including only points 
at years 1990 and 1995 yielded a better prediction of the coefficients at years 
2000 and 2005. For males, there were multiple inflection points, and includ- 
ing only the last two coefficients would lead to underestimates of the values 
for the years 2000 and 2005, but adding one previous coefficient at year 1985 
would reduce the bias. 

The predicted male lung cancer mortality rates using the linear and 
ARIMA extrapolated coefficients from the APC models are then included 
in Table 6.3 for comparison with the simple extrapolations of mortality 
rates. For each age group, the best average predictions in the two periods 
(2000-2004 and 2005-2007) are highlighted in bold. With the exception 
of the first two age groups, the APC model-based projections performed 
much better than the simple extrapolations of age-specific mortality rates. 
This can be more clearly seen in Figure 6.4, which plots the age patterns 
of observed and predicted mortality rates for the two projection peri- 
ods together with the 9576 PIs. For simple extrapolation, only the linear 
extrapolation results are shown in the interest of space as the results 
hold for the ARIMA extrapolations. The figure shows that the predicted 
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Age-specific lung cancer mortality rate predicted by different methods with 95% PIs: 2000-2004 
and 2005-2007. 


mortality rates from the simple linear extrapolation were far higher than 
the actual mortality rates for both periods, and this difference increased 
with age. In contrast, the two APC model-based projections yielded much 
better predictions. The comparison of the interval predictions of the last 
two methods showed that the linear extrapolation produced narrower PIs 
than the ARIMA method. This is because the ARIMA models automati- 
cally incorporated the stochastic process, allowing for appropriate ran- 
dom structure of the prediction. The ARIMA method thus provided better 
coverage of the true values. For example, the point predictions from both 
methods overestimated female death rates at age 85+, but the ARIMA PIs 
covered the true values, whereas the linear PIs did not. Similar results of 
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internal validation were found for the other cancer sites. While the linear 
extrapolation method yielded better point forecasts, the ARIMA method 
produced wider PIs that covered the true values when the point forecasts 
were biased. 

Figure 6.5 further compares the observed and projected lung cancer mor- 
tality rates by time period for selected age groups between 40-44 and 70-74. 
Figure 6.6 shows the corresponding results for other cancer sites. Both sug- 
gest that the APC model-based projections were vastly superior to the simple 
extrapolation projections. This was especially pronounced when the mortal- 
ity trends changed directions in certain periods due to cohort changes. The 
simple linear projections extended previous period trends but ignored cohort 
changes. For example, the lung cancer and white female breast cancer mor- 
tality rates were predicted to increase in the projection period (2000—2007) at 
older ages by the simple linear extrapolation method, whereas the observed 
rates declined. While the female colorectal cancer mortality rates were pre- 
dicted to decrease in the projection period at ages 40—44 by the simple linear 
extrapolation method, the observed rates slighted increased. The simple lin- 
ear projections deviated from observed mortality rates by large measures for 
the other cases. In contrast, the APC model-based projections were able to 
capture actual mortality rates much better by accounting for cohort changes 
that partly generated the observed trends. 


6.3.2.4 Forecasting Results 


Based on the results of the foregoing internal validation, we forecast the 
mortality rates of major cancer sites into the next 20 years from 2010 to 2029. 
Using the full set of observed mortality data from 1969 to 2007, we first 
estimated the APC models using the IE to obtain the period and cohort 
coefficients and fit linear regression models and ARIMA models to these 
coefficients. The results of the lung cancer mortality forecasting by sex 
are shown in the right three panels of Table 6.4. For the linear coefficient 
extrapolations, we used the last two period coefficients and the last 10 cohort 
coefficients as the base for projection based on conclusions from previous 
research and the internal validation findings. The results obtained from this 
approach that emphasized most recent period influences can then be com- 
pared with those from the ARIMA approach that utilized information from 
all previous periods and cohorts. 

Table 6.5 presents the point forecasting results for the lung cancer mor- 
tality rates using the two APC model-based methods. The forecasted age- 
specific rates with 9576 PIs are then displayed in Figure 6.7a. The male lung 
cancer mortality rates were predicted to continue to decline for all ages, with 
the rates of decline slowing down in 2025-2029 for ages 50-54 and 70-74. 
The female lung cancer death rates were predicted to continue to increase 
for the next 10 years for ages 80-84 and decline afterward. The lags between 
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Colorectal, prostate, and breast cancer mortality rate predicted by different methods with 95% 
PIs by period for selected age groups. 


the male and female declines reflect the sex difference in cigarette smoking 
for cohorts born in the early twentieth century widely observed in previous 
studies and suggested by the IE estimates of cohort trends in lung cancer mor- 
tality (Figure 6.3). Figures 6.7b—6.7d display the forecasted age-specific rates 
with the 95% PIs for colorectal, prostate, and breast cancer mortality, respec- 
tively. Overall, point forecasts showed mortality declines for all cancer sites 
across most ages. Since the period effects were mostly projected to increase 
or remain constant, this was largely due to the cohort effects that showed 
decreases in cohort mortality. The point forecasts from the two projection 
methods were similar for lung mortality. For the other three cancer sites, the 
two methods demonstrated larger differences. The differences were particu- 
larly large for blacks, with the ARIMA forecasts predicting less declines in 
prostate cancer mortality and evident increases in breast cancer mortality. 
With the exception of male colorectal cancer mortality, the predicted val- 
ues from the ARIMA models were consistently higher than those from the 
LMs. The ARIMA models and the LMs yielded much similar results of lung 
cancer mortality rates because the period effect coefficients had a stronger 
linear trend (Figure 6.3). In contrast, the period effect coefficients showed 
considerable slowdowns in mortality increases for prostate and breast cancer 
mortality in more recent periods. While the LMs fit to the last two periods 
extended such slowdowns into the future, the ARIMA models incorporated 
earlier periods of mortality increases and resulted in extensions of higher 
period coefficients, which in turn translated into less-optimistic prospects 
of predicted rates. 
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The interval forecasting results showed increases in widths of the Pls with 
age and time period. The Pls of the forecasts for lung cancer mortality were 
considerably tighter for males than females, whereas the Pls of the forecasts 
for colorectal cancer mortality were narrower for females than males. The 
Pls for the forecasts of prostate and breast cancer mortality were substan- 
tially larger for blacks than whites, particularly at older ages, indicating that 
the mortality data were less robust for blacks than for whites at these ages. 

In sum, we have demonstrated the utility of the APC models for forecast- 
ing using cancer mortality as examples. The APC model-based forecasting 
methods showed substantial advantages over the simple data extrapolation 
methods in accuracy of point forecasts and assessments of uncertainty in 
these forecasts through Pls. Like any forecasting studies, the APC model- 
based forecasting should rest on reasonable assumptions of how future rates 
relate to the past. Extrapolations of model coefficients yielded results highly 
sensitive to the choice of assumptions and must be evaluated empirically. 
We have shown how to achieve a range of predictions based on alternative 
assumptions. The ARIMA approach produced wider PIs and thus may be 
more appropriate for conservative forecasting if no other information exists 
that indicates stronger influences of more recent periods on the future. In 
the case of cancer, changes in period conditions can occur frequently. So, 
weighted linear regression models based on the most recent periods can be 
highly effective. The APC model-based forecasting methods introduced here 
concern the extrapolations of model coefficients of period and cohort effects. 
Recent studies of cancer incidence also suggested the use of health-based 
forecasting that incorporates health-related risk factors such as smoking rates 
and treatment effects (Manton, Akushevich, and Kravchenko 2009). How to 
conjoin these two modeling approaches to further improve the accuracy of 
forecasting of cancer incidence and mortality rates remains a challenge for 
future development of forecasting methodology. 


Appendix 6.1: The Bootstrap Method Using a Residual 
Resampling Scheme for Prediction Intervals 


1. Fit the APC model and retain the fitted link values ñ; and the resid- 
ual ê; = y; - lt; = log(y;) - D, i = 1, 2, ..., n, where n indicates the number 
of observations or the number of age groups a times the number of 
period groups p. 

2. For each pair x; ji in which x; is the explanatory variable, add a ran- 
domly resampled residual 8; to the link variable u,. In other words, 
create the synthetic link and response variables It; = Ñ, + êj and y; = 
exp(u;) where j is selected randomly from the list (1, ..., n) for every i. 
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3. Refit the model using the fictitious response variables y;, make fore- 
casting based on the synthetic jj; as from the prediction procedure, 
and retain the predicted y;. 

4. Repeat Steps 2 and 3 a large number of times (10,000 times here) to 


obtain (10,000) samples of y; and derive the mean, standard errors, 
and PIs. 
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Mixed Effects Models: Hierarchical APC- 
Cross-Classified Random Effects Models 
(HAPC-CCREM,, Part I: The Basics 


7.1 Introduction 


In Chapter 5, we discussed the advantages and disadvantages of various 
approaches to identification and estimation of the A, P, and C effect coef- 
ficients for the conventional APC accounting/multiple classification model. 
In this context, the Intrinsic Estimator method has a number of desirable sta- 
tistical properties and often yields good estimates of the effect coefficients. It 
is important to point out, however, that the classical APC identification prob- 
lem is not inevitable. It is largely a function of the application of conventional 
linear or generalized linear models to APC data, often but not exclusively in 
the form of the first research design (tabulated tables of population rates or 
proportions). These models not only are certain to produce the identifica- 
tion problem but also may be a poor approximation to the process of social 
change that a researcher seeks to model. We now discuss major limitations 
of linear models and introduce alternative model specifications through the 
mixed effects models increasingly employed in demography, epidemiology, 
and the social sciences to address these limitations. 

We then define and discuss the properties of a hierarchical age-period- 
cohort (HAPC) modeling framework. An HAPC framework does not incur 
the identification problem because the three effects are not assumed to be lin- 
ear and additive at the same level of analysis. In addition, HAPC models can 
capture the contextual effects of periods and cohorts and thus stimulate new 
conceptualizations of processes of change. This chapter focuses on the expo- 
sition of the basic cross-classified random effects modeling (CCREM) speci- 
fications of HAPC models through algebraic analysis and empirical analyses 
of examples of the second research design (repeated cross-sectional sample 
surveys), including the General Social Survey (GSS) and National Health and 
Nutrition Examination Survey (NHANES) described in Chapter 3, to show 
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how HAPC models for such designs avoid the identification problem and 
offer more opportunities for testing explanatory hypotheses. 


7.2 Beyond the Identification Problem 


In many previous applications of linear models to age-period-cohort (APC) 
analysis, the identification problem could not be avoided. First, because most 
of these applications were focused on aggregate population data using the 
first research design, the three variables relate to one another in an exact 
linear fashion. For example, for each age and calendar year, there is a unique 
birth cohort. This linear dependency, however, does not constitute a prob- 
lem until certain assumption is made about how the data simultaneously 
affect an outcome of interest. The linear models fit to the APC data in this 
form, namely, APC accounting models, induce this problem by imposing 
the assumption of additivity and treating the three temporal variables as 
fixed effects that are independent of each other. Since the formalization 
of accounting /multiple classification models for APC analysis in the early 
1970s (Mason et al. 1973), the voluminous literature in social sciences and 
biostatistics has focused on the resulting underidentification “conundrum” 
as a methodological problem. However, the problem is also a theoretical one. 
That is, a linear APC model may not accurately describe age-, period-, and 
cohort-related phenomena. It should be noted that additivity of the effects 
of these three temporal dimensions is only one simple approximation to the 
process generating time-related changes. Some cohort or period phenomena 
involve problems that can hardly be handled by any version of APC account- 
ing models (Hobcraft, Menken, and Preston 1982; Smith 2004). So, there is a 
need to develop new models that accommodate both additive and nonaddi- 
tive processes. 

Our point of departure then is to develop such models that present a more 
thorough solution to the identification problem (by not incurring it); better 
characterize the cohort and period effects; stimulate new conceptualizations 
of processes of social, demographic, and epidemiologic changes; and are 
capable of addressing new theoretical questions about these changes. We 
begin with the observation in Chapter 3 that the repeated cross-sectional 
design is in fact a multilevel design in which individual-level observations 
are nested in, and cross-classified simultaneously by, the two higher-level 
social contexts defined by time period and birth cohort.’ Applications of lin- 
ear models to such a design based on the assumption of fixed period and 


` Parts of Sections 72-75 are adapted and updated from Yang, Y., and K. C. Land. 2006. 
Sociological Methodology 36:75-97 and Yang, Y. and K. C. Land. 2008. Sociological Methods & 
Research 36:297—326. 
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cohort effects ignore the multilevel structure of the data design and may not 
be adequate substantively or statistically. 

We have noted before that when data become available to allow age inter- 
vals to differ from period intervals, such as the case of sample surveys in 
the second research design, one can utilize the unequal intervals to group 
one or more of the three variables to break their exact linear dependency. As 
explained in Chapter 4, this solution is not capable of resolving the identifica- 
tion problem completely when used in the conventional linear models in that 
the results may be sensitive to specific interval lengths and hence degrees 
of overidentification. We further note here that such differential groupings 
are also problematic for statistical inference. To illustrate this, consider the 
application of the fixed effects APC regression model of Equation (4.5) to, say, 
the following five sample members, ages 31, 32, 33, 34, and 35 at last birthday, 
each of whom is a member of the same 5-year birth cohort, the 1955—59 birth 
cohort, and each of whom is a survey respondent in the 1990 GSS: 


Y. 19901955-59 = H + 0(31) + £ 1990195550 (71) 
Y»19901955-59 = H + 0(32) + £ 1990195550 (7.2) 
Y3,1990,1955-59 = H + 0(33) + €3,1990,1955-59 (7.3) 
Y4,1990,1955-59 = H + 0(34) + €4,1990,1955-59 (74) 
Y5,1990,1955-59 = H + 0(35) + €5,1990,1955-59 (7.5) 


where the five individual sample respondents are numbered from 1 through 5, 
respectively, their respective ages (31 through 35) have been entered into the 
age term of the model whose coefficient is a. To complete the APC model 
specification of Equation (4.5) in Chapter 4, we have the following specifica- 
tion on the error terms: 


E;,1990,1960-64 = P1990 + Yisss-s9 + €i19901955-5» for í = 1, 2, ...,5 (7.6) 


The error terms specifically include fixed effect coefficients to measure the 
impact of the time period (Bi) and the birth cohort Dos so) to which these 
sample respondents belong. 

The fixed effects model of Equations (7.1)- (7.6) could be estimated straight- 
forwardly by using dummy variables to control for the period and cohort 
effects in a conventional multiple linear regression model using the differ- 
ential grouping as the identifying constraint (single-year-of-age and 5-year 
cohorts). The key assumption used here is that impacts of cohort and period 
(survey year) on the responses of sample members are adequately modeled 
as fixed. This ignores the possibility that the effects of cohort membership 
and survey year may have shared random as well as, or instead of, fixed 
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effects on the responses; that is, that the observations are correlated. That is, 
sample respondents in the same cohort group or survey year may be similar 
in their responses due to the fact that they share unmeasured contextual 
random error components (i.e., through random cohort or period compo- 
nents of e;1990,1955-59) unique to their cohorts or periods of the survey. Note 
that the sharing of common elements in the error terms may result in such 
weak covariation among responses that there are no serious complications 
for estimates of APC coefficients for standard regression models estimated 
by ordinary least squares (OLS). But, a failure to assess this potentially more 
complicated error structure adequately in APC analysis may have serious 
consequences for statistical inferences. The standard errors of estimated 
coefficients of fixed effects regression models like Equations (7.1)-(76) may 
be underestimated, leading to inflated f ratios and actual alpha levels that 
are larger than the nominal .05 or .01 levels (Hox and Kreft 1994). 

This heterogeneity problem can be addressed by modifying the fixed 
effects specification toward a random effects or random coefficients regres- 
sion model. That is, to take into account the possibility that the common 
period and cohort elements of the error terms of Equations (7.1)-(7.6) are sta- 
tistically significant, we should allow for the possibility that at least some of 
the effect coefficients u, 0, Been, and Yioss-s9 in those equations are not fixed 
but instead vary randomly by cohort or time period. This implies that we 
should modify the fixed effects linear APC model (4.5) in Chapter 4 to a 
mixed effects model. 

The mixed (fixed and random) effects model is thus a reasonable alternative 
to the linear model because it does not assume fixed age, period, and cohort 
effects that are additive and therefore avoids the identification problem. 
More important, it can statistically characterize contextual effects of histori- 
cal time and cohort membership and reveal the process by which individu- 
als' lives are shaped by their environment. Prior studies utilizing multilevel 
mixed effects models in the analysis of temporal change are few but exist in 
demographic research on health (Lynch 2003) and developmental psychol- 
ogy and aging research on cognitive skills (e.g., Alwin 2009). These examples 
did not explicitly embody a full-blown APC analysis and focused instead on 
age patterns in the context of specific cohorts only. The breakthrough of the 
approach to cohort analysis developed initially by Yang and Land (2006) and 
described in the following is the simultaneous modeling of all three factors 
using multilevel data and mixed effects or hierarchical models. 

In addition to its ability to account for the multilevel heterogeneity in the 
data, the mixed effects modeling approach has a second important advan- 
tage of being able to explain random variability by the incorporation of addi- 
tional covariates at both the individual level (especially for microsurvey data 
design) and the higher level. That is, through the specification and model- 
ing of individual attributes and contextual characteristics of time period and 
birth cohort, this approach allows for the test of explanatory hypotheses of 
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A, P, and C effects not possible in the APC accounting model framework. 
It also improves on the proxy variable approach mentioned in Chapter 4. 
That is, whereas the proxy variable approach equalizes period or cohort 
effects with period- or cohort-specific variables and thus is vulnerable to 
problems of misspecification, the mixed effects model approach accounts 
for the uncertainty of corresponding effects through explicit error terms 
and provides means to ascertain how much of the period or cohort variance 
can actually be explained by measured period- or cohort-level characteris- 
tics. All these features of the mixed effects modeling framework facilitate 
mechanism-driven analyses that move far beyond the conventional APC 
analysis of trend identification. 


7.3 Basic Model Specification 


The simplest form of model specification of the HAPC model as introduced 
by Yang and Land (2006) is a member of the class of linear mixed models 
(LMMs): This model specification is the most widely used form of hierarchi- 
cal or multilevel linear models as presented in such standard expositions 
as those of Raudenbush and Bryk (2002) and Snijders and Bosker (1999). 
It consists of a two-component model: The level 1 component is a regres- 
sion of an individual-level outcome variable on a set of individual-level 
explanatory variables (regressors, covariates) with an intercept term, fixed 
regression slope coefficients, and an individual-level random error term. 
Level 2 models use level 1 regression coefficients as outcomes and contain 
intercepts and specification of random effect coefficients for the effects of 
each cohort and time period distinguished in the model. The level 2 model 
may also contain cohort or time period explanatory variables with fixed 
effect coefficients that are hypothesized to explain, at least in part, the cohort 
or period effects (Yang 2006), which are illustrated in the next chapter. 

To describe the HAPC model, we focus for now on the example of trends 
of verbal ability in the GSS data. The previous findings on trends in verbal 
scores are interesting and suggestive. But until age, period, and cohort effects 
are simultaneously estimated, the question of whether the trends are due to 
any of these components remains unresolved. In recognition of the multi- 
level characteristics of the GSS data structure shown in Chapter 3 (Table 3.5), 
we formulate a CCREM specification of the HAPC model to assess the 


* General matrix algebraic representations of LMMs and generalized linear mixed models 
(GLMMs) are given in Appendix 71. The exposition in the chapter text focuses on scalar 
representations accessible to readers who do not have knowledge of matrix algebra. 
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relative importance of the two contexts, cohort and period, in understanding 
individual differences in verbal test outcome. 

Because the WORDSUM outcome variable has a relatively bell-shaped 
sample frequency distribution, it is reasonable to use an HAPC mixed model 
specification that has a conventional normal errors level 1 fixed effects regres- 
sion model. In the absence of evidence to the contrary, this level 1 model can 
be combined with a conventional normal period and cohort residual random 
effects specification at level 2. In such an LMM model applied to the verbal 
test data, variability in WORDSUM associated with individuals, cohorts, and 
periods is specified as follows: 


Level 1 or ^within-cell" model: 
WORDSUM jx = Ba, + Bi AGE; + B» AGE} + BSEDUCATION jx 
+ B4SEX jx + B5RACE jx + eix (77) 


with ou ~ N(0,0?) 


Level 2 or “between-cell” model: 


Boi = Yo + Uoj + Vok, with Uoj ~ N(0, T4) , Vok ~ N(0,,) (78) 


Combined model: 


WORDSUM  — Yo + B1 AGE jx  B3AGE;,  B5EDUCATION A, 
(79) 
+ B4SEX jg, +BsRACE;k + Uoj + Vok + E ij 


for 
i= 1,2, ..., ny individuals within cohort j and period k; 
j= 1, ..., 20 birth cohorts; 
k =1,..., 17 survey years; 


where, within each birth cohort j and survey year k, respondent (o verbal 
score is modeled as a function of his or her age, age-squared, educational 
attainment, and two covariates, gender and race, that have been found in 
previous research to be related to verbal ability (see, e.g, Hedges and Nowell 
1995; Campbell, Hombo, and Mazzeo 2000). This random intercepts model spec- 
ification allows only the level 1 intercept to vary randomly from cohort to 
cohort and period to period, but not the level 1 slopes. One can also specify 
the cross-classified random coefficients model specification of the HAPC model 
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(see Section 76) wherein level 1 slope coefficients also have random varia- 
tions across cohorts and periods. Because supplemental analyses of the ver- 
bal test score data did not show significant random cohort or period effects of 
any level 1 slope coefficients, we estimated only the random intercepts model. 
We illustrate this more sophisticated specification in other examples. 

In model (77)-(79), Boz is the intercept or "cell mean,” that is, the mean 
verbal test score of individuals who belong to birth cohort j and surveyed 
in year k; B, ..., Bs are the level 1 fixed effects; e;, is the random individual 
effect, that is, the deviation of individual ijk's score from the cell mean, which 
is assumed normally distributed with mean 0 and a within-cell variance 07; 
Yo is the model intercept or grand-mean verbal test score of all individuals; 
ug; is the cohort effect or residual random effect of cohort j, that is, the contri- 
bution of cohort j averaged over all periods, on Bu, assumed normally dis- 
tributed with mean 0 and variance v,; and v, is the period effect or residual 
random effect of period k, that is, the contribution of period k averaged over 
all cohorts, assumed normally distributed with mean 0 and variance T,. In 
addition, Bo; = Yo + Uo; is the cohort verbal test score random effect averaged 
over all periods; and By, = Yo + v; is the period verbal test score random effect 
averaged over all cohorts. 

The HAPC-CCREM of Equations (77)-(79) is defined by its statistical 
parameters: the regression parameters Yo, By D, Bs, By, Bs and the variance 
components, 6?, T,, and t,. Two major approaches to the estimation of these 
parameters (Longford 1993; Jiang 2007) are maximum likelihood (ML) and 
restricted maximum likelihood (REML). The two methods differ little with 
respect to estimation of the regression coefficients. For estimation of the vari- 
ance components, however, the REML method takes into account the loss of 
degrees of freedom resulting from the estimation of the regression param- 
eters, whereas the ML method does not. The consequence is that the ML esti- 
mators for the variance components have a downward bias, and the REML 
estimators do not. For mixed models with relatively small numbers, say less 
than 30, of groups, this can be important. Since the numbers of time periods 
and cohorts in HAPC models almost always will be less than 30, REML is the 
preferred estimator for HAPC models. 

This model specification can be used to highlight an important differ- 
ence between the fixed effects coefficients of the level 1 model (77) and the 
random effects of the level 2 model (78). In brief, the individual-level fixed 
effects regression coefficients are parameters to be estimated in a conventional 
statistical sense. As Snijders and Bosker (1999: 58) emphasized, by contrast 
the random group (time period and cohort) effects are latent variables (i.e., not 
directly observable) rather than statistical parameters and accordingly are 
not an integral part of the statistical parameter estimation (see Appendix 71). 
In statistical terminology, the random effects are “predicted” rather than esti- 
mated, as the term estimation is reserved for finding likely values of statistical 
parameters. The method used to predict the random effects is empirical Bayes 
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(EB), which produces posterior means (Etron and Morris 1975; Jiang 2007: 86). 
For a specific group in model (77)-(79), say a specific cohort, the EB estimator 
predicts its random effect as a weighted average of the adjusted grand mean 
or population average of the WORDSUM outcome variable across the pooled 
sample of all observations and the estimated average impact of that cohort 
on the expected value of the outcome variable across all time periods. The 
weighted average takes the form of a convex combination of these two esti- 
mates, where the weights are a function of the estimates of the variances of 
the cohort effects and the errors of the level 1 model. The resulting predicted 
random effect shrinks the cohort-specific estimated effect toward the adjusted grand 
mean, which is considered to be more reliably estimated (because it is based 
on a larger sample of observations) than the average impact of a specific 
cohort. EB estimators thus are biased toward the population averages but 
have a smaller mean-squared error for a randomly drawn group or cohort. 
When used in combination with the REML method for estimation of the 
HAPC model parameters, the result is a REML-EB algorithm, which is the 
method programmed into most conventional software programs for mixed 
model estimation. 

An important decision in mixed effects/hierarchical linear model analy- 
ses pertains to "centering" or choosing the location of the individual-level 
explanatory variables (Raudenbush and Bryk 2002). The main choices are (a) 
using the natural metric (NM) of the variables, (b) grand mean centering (GMC) 
by subtracting the complete sample or grand mean from the observed val- 
ues; and (c) centering within subgroups or contexts (CWC) studied by subtract- 
ing subgroup means from observed values. For hierarchical models in which 
only the intercept but not the slopes is random at level 1, as is the case for the 
models mentioned, Snijders and Bosker (1999: 81) showed that all three of the 
NM, GMC, and CWC approaches led to models that are statistically equiva- 
lent in terms of the parameterizations of the combined models. In fact, we 
found empirically in our analyses of the WORDSUM data that there is not 
a great deal of difference among estimated coefficients under the three dif- 
ferent approaches (although there are some variations in terms of variance 
decompositions and fit statistics). Thus, in the absence of methodological 
guidelines that privilege one of the three alternatives, substantive-theoretical 
reasoning guided the choice of centering. Because the minimum value of the 
key explanatory variable of age does not include zero (since the GSS sample 
frame is for ages 18 and over) in the model of Equation (79), one of the other 
options should be used. Furthermore, the literature on the effects of age on 
vocabulary knowledge (Wilson and Gove 1999b: 257—258) cites a pure physi- 
ological age effect that does not vary by cohort context. Therefore, we applied 
centering on the grand mean to the individual-level age variable in Equation 
(79). In the case of education, by contrast, Wilson and Gove (1999b: 255-256) 
argued that changing average levels of school years completed varies very 
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substantially across the cohorts surveyed in the GSS. To take this changing 
cohort context of education into account, we therefore centered education on 
the cohort means. 


7.4 Fixed versus Random Effects HAPC Specifications 


An alternative model specification for HAPC analysis of repeated cross- 
sectionalsurvey data would specify thetime periodand cohort effects as fixed 
rather than random. Under what conditions should an analyst use a fixed 
rather than a random effects specification? The literature on hierarchical/ 
multilevel regression models contains some general guidelines (not specific 
to HAPC models) on when certain effect coefficients should be treated as 
fixed or random. To articulate these guidelines, consider a simple hierarchical/ 
multilevel linear model (HLM) that has a level 1 or individual-level model: 


Yj = Bo Bixj +e; (7.10) 


where Y; is a response variable for individual i in group j, x; is an explanatory 
variable or regressor for individual i in group j, By is the intercept parameter 
for the regression model, B, is the slope parameter of the regression, and e; 
is a random error term. Suppose that the intercept B, is group dependent, 
and that it varies randomly among J observed groups. To model this random 
variation, we specify the level 2 or group-level model: 


Bo = Yoo + rj (711) 


Thislevel 2 model separates the group-dependent intercept into an average inter- 
cept among the groups Yoo plus a group-level deviation or error ry. Substitution 
of Equation (711) into Equation (710) then yields the combined model: 


Y; = Yoo + Bixy +10) + ei (712) 


The values of the ry are the main effects of the groups: Conditional on 
having a specific X-value and being in group j, the expected Y-value for indi- 
vidual i deviates by ry from the average expected value for all individuals 
over all groups. Note again that this is the simplest possible formulation of a 
hierarchical model; a more general formulation would allow for the possibil- 
ity that the slope coefficient in Equation (710) could vary among the groups, 
and there could be more than one explanatory variable in the level 1 model. 
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As a statistical model, Equations (7.10)-(7.12) can be treated in two ways: 


1. As a fixed effects model, wherein the ro; are treated as fixed param- 
eters, J in number. This approach leads to a specific instance of a 
fixed effects regression model, namely, the conventional analysis-of- 
covariance model, in which the grouping variable is a covariate. 


2. As a random intercepts model, wherein the roj are assumed to be inde- 
pendent identically distributed random variables (more generally, if 
the slope coefficient D, is specified as interacting with the level 2 ran- 
dom effects, then the model is a random coefficients model). These errors 
now are assumed to be randomly drawn from a population with zero 
mean and an a priori unknown variance. This assumption is equiva- 
lent to the specification that the group effects are governed by mecha- 
nisms or processes that are roughly similar from group to group and 
operate independently among the groups. This is termed the exchange- 
ability assumption. The random coefficients model also requires the 
assumption that the random level 2 or contextual effects Ge, the ry 
coefficients) are distributed independently of the level 1 regressors. 


These two approaches to the model of Equations (710)-(712) imply that 
hierarchical data generally can be analyzed in two different ways using 
models with fixed or random group-level coefficients. Which of these two 
specifications is the most appropriate in a given situation depends on a num- 
ber of considerations. 

Goldstein (2003: 3-4) and Snijders and Bosker (1999: 43-44) provided sum- 
maries of conventional statistical wisdom and methodological guidelines for 
choosing between the fixed or random specifications. They pointed out that 


1. If the groups are regarded as unique entities and the objective of 
the analysis is primarily to draw conclusions pertaining to each of the 
] groups, then it is appropriate to use the conventional analysis-of- 
covariance model. 


2. If the groups are regarded as a sample from a (real or hypothetical) pop- 
ulation and the objective of the analysis is to make inferences about 
this population, then the random coefficients model is appropriate. 


3. The fixed effects model explains all differences among the groups by the 
fixed effect adjustments (through the use of indicator or dummy vari- 
ables to represent the group-level adjustments) to the intercept coef- 
ficient of Equation (7.10). This implies that there is no between-group 
variability left that could be explained by group-level variables. 


` In applications to cross-classified data from a repeated cross-section survey design, the fixed 
effects specification would lead, in parallel to the cross-classified random effects and random 
coefficients model forms of HAPC models, to the class of cross-classified fixed effects model 
(CCFEM) form of HAPC models. 
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Therefore, if the objective of the analysis is to test effects of group- 
level variables, the random coefficient model should be used. The 
exception to this guideline pertains to the case wherein the analyst 
introduces explicitly measured group-level variables that are hypoth- 
esized to account for the group-level effects. In this case, however, 
the model cannot at the same time incorporate indicator variables 
for the group-level fixed effect adjustments. Rather, the analyst must 
assume that the group-level fixed effect adjustments are completely 
accounted for by the explicitly measured group-level variables." 


4. The random coefficients model typically is used with some addi- 
tional assumptions. Most important, as noted, it requires that the level 
2 random residuals be distributed independently of the level 1 regressors/ 
explanatory variables, which implies that corrr, X) = 0. In addition, 
in conventional normal errors HLM models, it is assumed that the 
random coefficients ry and e; are normally distributed. If these assump- 
tions are poor approximations to the characteristics of a specific set 
of empirical data (e.g., the regressors are not independent of the ran- 
dom coefficients, or there is high density in the tails of the distribu- 
tions of the errors), then these assumptions should be modified. 


These, then, are several of the main considerations that conventional statisti- 
cal wisdom indicates should be taken into account in deciding on fixed ver- 
sus random effects formulations of hierarchical statistical models. 

Applied to HAPC models, this reasoning leads to the following guidelines: 


1. The range of the age categories for contemporary human popula- 
tions is essentially fixed at 0 to 125, and most empirical studies uti- 
lize only a part of this fixed range (e.g, the adult ages 18 to 89 in 
the GSS WORDSUM data). Therefore, the individual ages or age cat- 
egories may be regarded as unique entities, and it is reasonable to 
specify the age effects as fixed. 


2. On the other hand, the time period and cohort categories available 
for any specific empirical analysis typically are only a sample of 
periods and cohorts for any human population. Therefore, it also is 
reasonable to specify the period and cohort effects as random. 


3. In addition, a key problem with the fixed effects specification is 
the assumption that the indicator/dummy variables representing the 
fixed cohort and period effects fully explain all of the cohort and 
period effects. That is, the fixed effects model does not allow for 
the possibility of any additional random variance associated with 
the individual cohort and period effects. This implies that there is 


` This assumption is the same as that of the proxy variables approach to identification of the 
conventional APC accounting model; see Section 4.4.4 of Chapter 4. 
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no unexplained between-cohort or between-period variability left 
beyond that captured by the fixed cohort and period effects. In the 
context of HAPC models, this appears not to be the best assumption 
with respect to statistical estimation of the parameters of the models. 
First, the fixed effects model tends to produce substantially larger 
standard errors for the intercept, indicating much more uncertainty 
in the mean estimate (for an empirical example, see Yang and Land 
2008: 314). The CCREM can explain the variance for the intercept 
better than the indicator variables representing the fixed period 
and cohort effects. Second, fixed effects models require estimating 
unique effect coefficients for each period and cohort: (J — 1) + (K - 1) 
parameters in all. Random effects models instead estimate one vari- 
ance parameter that represents the distribution of the random effects 
for the periods and one variance parameter that represents the dis- 
tribution of random effects for the cohorts. The latter usually yields 
a better model fit. Third, repeated cross-sectional survey data tend 
to be highly unbalanced, where unbalanced means that, when the 
sample members in a repeated survey design are cross classified by 
cohort (arrayed, say, in rows) and time period of observation (arrayed 
in columns), the numbers of observations in cells above the diagonal 
are not symmetric with those below the diagonal. And, under such 
data designs, mixed effects models use the available information in 
the data more efficiently and show better statistical efficiency rela- 
tive to fixed effects models (e.g., Duchateau and Janssen 1999). 


All of these considerations point toward advantages of the specification of 
HAPC models as mixed effects models rather than as pure fixed effects mod- 
els. There are exceptions, however. One exception is the situation in which 
a very small number of time periods of repeated cross-sectional surveys, 
say only two or three, is available.’ In this case, it may be quite reasonable to 
specify the effects of time periods as fixed with the cohort effects specified 
as either fixed or random. 

A second exception pertains to situations in which the assumption that 
the level 2 random effects are distributed independently of the level 1 
regressors is not valid. Note that most conventional empirical applications 
of hierarchical linear models proceed without a careful examination of the 
empirical veracity of this assumption. By contrast, the comparative perfor- 
mance of the fixed and random effects model specifications is a standard 
part of model criticism and assessment in longitudinal panel models (often 
referred to as pooled time series cross-sectional models) in economet- 
rics (see, e.g, Greene 2003: 301-303). This is due to the general results in 


` Using a thinned sample of five time periods of GSS verbal test score data and five cohorts, 
Yang and Land (2008) found that the CCREM specification produced results similar to those 
with 19 cohorts and 15 time periods of data. 
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statistical theory for mixed fixed-random effects models that, under the null 
hypothesis of zero correlation between the individual-level regressors and 
the contextual effects coefficients, both the OLS estimator of the individual- 
level coefficients in the fixed effects model and the REML estimator of those 
coefficients in the random effects model are consistent, but the OLS estima- 
tor is inefficient. Therefore, under the null hypothesis, the two estimators 
should produce estimates of the individual-level coefficients that do not dif- 
fer systematically. 

To describe this and a corresponding statistical test for the tenability of 
the independence assumption in HAPC models in more detail, consider the 
cross-classified fixed effects model (CCFEM) corresponding to the CCREM of 
Equations (77)-(79) where the effects of the cohorts uj, j = 1, ..., J and the 
effects of the time periods (years) of the surveys Gu, k = 1, ..., K are assumed 
fixed and unique to each of the respective cohorts and period rather than 
variable and random. In practice, the fixed effects of the cohorts and periods 
are estimated by the incorporation of two sets of indicator/dummy variables 
for J - 1 cohorts and K - 1 periods. Therefore, Equation (78) changes to 


19 15 
Boj = Yo + wy, Cohort; + Yx >, Period, (713) 
j=2 k=2 


where the variance in the intercept Bo, is assumed to be completely captured 
by the indicator variables for cohorts and periods. Substituting this expres- 
sion into Equation (77) yields the combined CCFEM: 


WORDSUM;, = Y B AGE; +B,AGE2, + BXEDUCATION x, 
19 15 (7.14) 
+ B4FEMALE;, + BSBLACKs, + Y1; D Cohort; + y», A. Period, + ep 


j=2 k=2 


This fixed effects HAPC specification is in the form of a cross-classified 
analysis of covariance model. Because of the nesting of the individual-level 
observations within the time period and cohort groups and the flexibility 
of differential temporal intervals facilitated by the presence of individual- 
level observations in repeated cross-sectional survey designs as described 
in Section 7.2, however, the CCFEM does not have the underidentification 
conundrum of the classical APC accounting model described in Chapter 5. 
Nonetheless, because the individual-level observations corresponding to 
a specific cohort or time period may have correlated errors or variances 
that differ from those of observations from other periods or cohorts, a 
heteroscedasticity-corrected estimator (White 1980; Greene 2003: 198—199) 
of the variance-covariance matrix should be estimated. A comparison of 


204 Age-Period-Cohort Analysis 


estimates of the resulting standard errors of the regression coefficients with 
the noncorrected estimates permits an analysis of the extent to which the clus- 
tering of the individual-level observations within periods and cohorts affects 
the variances. 

Following along the lines of comparisons of random and fixed effects 
models in longitudinal panel studies, we now can make a similar compari- 
son of the CCREM and CCFEM models for the repeated cross-sectional data 
on verbal ability in the GSS. Note that the present comparison differs from 
standard longitudinal panel designs in that the same individuals are not 
repeatedly surveyed in consecutive waves of the GSS. However, given the 
temporal dimensions embedded in the cohort and time period contextual 
variables as we have defined them, it is important to address the indepen- 
dence assumption explicitly. 

Yang and Land (2008) recommended a two-step procedure for assessing the 
assumption of the independence of the random effects and the individual- 
level regressors. First, estimate both the CCREM and the CCFEM models 
and qualitatively assess the resulting model fits and the parameter estimates 
and performance of each with respect to the data. Second, calculate a statisti- 
cal test by applying a form of what is known in the econometric analysis of 
pooled time series cross-sectional regression models as a Hausman specifi- 
cation test (see Hausman and Taylor 1981; Baltagi 1995). The Hausman test is 
a Wald chi-squared test of the form 


w-x[k]- [p-8] £^ [0-8] (715) 


where, in the present case, b denotes the vector of individual-level regression 
coefficients estimated from the CCFEM model, B denotes the corresponding 
vector of regression coefficients estimates from the CCREM model, and 
>= Var[b] - Var[B] i isthe difference of the variance-covariance matrices of the 
two estimators (the constant term is excluded from all vectors and matrices). 
Under the null hypothesis that the cohort and period random effects in the 
CCREM model are independent of the individual-level regressors, W is dis- 
tributed as chi squared with K degrees of freedom, where K is the dimension 
of the b and B vectors. Applied to the GSS verbal test score data for the years 
1974—2000, Yang and Land (2008: 317-319) found that the null hypothesis was 
not rejected; thus, the level 1 explanatory variables in the CCREM model of 
Equations (77)-(79) can be assumed to be distributed independently of the 
random effects for time periods and cohorts. In addition, Yang and Land 
(2008: 314—418) showed that the weighted averages of the estimated par- 
tial regression coefficients for time periods and cohorts in the CCFEM of 
Equation (714) displayed patterns of changes across the respective periods 
and cohorts that were very similar to those from the CCREM, which is con- 
sistent with the stochastic independence assumption. 
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In sum, the various substantive and statistical advantages of mixed effects 
model specifications of HAPC models described imply that these specifica- 
tions should be used under most circumstances. The exceptions, as indicated 
here, pertain to empirical applications in which there are very small num- 
bers of repeated cross-sectional surveys available for analysis or in which the 
independence assumption of mixed models is not tenable. 


7.5 Interpretation of Model Estimates 


For comparative purposes, Table 7.1 reports baseline OLS estimates of pooled 
repeated cross-sectional regression models without controls for period and 
cohort effects. It shows significant curvilinear age effects as hypothesized 
by Wilson and Gove (1999b). And, consistent with prior research on verbal 
ability, being female is positively associated with one's expected score on 
WORDSUM, whereas being black is negatively associated with the response 
variable. More years of education is associated with greater verbal ability. 
These covariates together explain 3176 of variance in WORDSUM. 

Table 72 reports the parameter estimates and model fit statistics for the 
CCREM [Equation (79)] estimated on the 17 GSS repeated cross-sectional 
surveys. These results were obtained using the REML-EB estimation 
method via the application of the SAS PROC MIXED (see the sample codes). 
Examining first the model fit statistics reported at the bottom of the table, it 
can be seen that the model deviance is very large compared to the degrees 
of freedom of the model, thus indicating a highly significant association of 
the explanatory variables with the WORDSUM response variable. The vari- 
ance components show that most of the variance in WORDSUM is accounted 


TABLE 7.1 


Fixed-Effects Regression Model for Pooled GSS 
WORDSUM Data, 1974-2006, without Controls 
for Period and Cohort Effects (N = 22,042) 


Fixed Effects Coefficient ` se tRatio p Value 
Intercept 6.22 0.02 281.63 «0.001 
AGE? 0.18 0.01 22.99 <0.002 
AGE? -0.05 0.00 -12.10 <0.003 
EDUCATION? 0.36 0.00 86.30 . «0.006 
SEX (1 = male) 0.22 0.02 9.22 <0.004 
RACE (1 = black) -1.06 0.03 -31.04 ` «0.005 

Adjusted R? 0.31 


a Centered around grand means. 


206 Age-Period-Cohort Analysis 


TABLE 7.2 
HAPC-CCREM of the GSS WORDSUM Data: 1974—2006 
Fixed Effects Parameter Coefficient se t Ratio p Value 
INTERCEPT Vo 6.18 0.06 112.50 <0.001 
AGE ñ. 0.03 0.02 1.71 0.087 
AGE By —0.06 0.01 -11.87 «0.001 
EDUCATION B. 0.37 0.00 86.57 «0.001 
SEX By 0.23 0.02 9.49 <0.001 
RACE Bs -1.03 0.03 -30.07 <0.001 
Random Effects Coefficient se t Ratio p Value 
Cohort 
1894 Uy 0.21 0.14 —1.48 0.140 
1895 Uy -0.11 0.12 -0.93 0.353 
1900 Us -0.05 0.10 —0.49 0.625 
1905 Uy -0.29 0.09 -3.27 0.001 
1910 Us 0.02 0.08 0.26 0.797 
1915 Us 0.16 0.07 2.22 0.027 
1920 u; —0.08 0.07 -1.15 0.249 
1925 Ug 0.08 0.07 1.23 0.220 
1930 Ug 0.00 0.07 0.01 0.990 
1935 Uio 0.07 0.06 1.06 0.289 
1940 Uy 0.24 0.06 3.91 <.001 
1945 Uy 0.45 0.06 7.50 <.001 
1950 Hu 0.18 0.06 3.10 0.002 
1955 Vu -0.04 0.06 -0.57 0.568 
1960 Hu 0.00 0.07 0.04 0.970 
1965 Uy6 -0.16 0.07 -2.20 0.028 
1970 ly; -0.14 0.08 -1.70 0.090 
1975 [S 0.00 0.09 -0.01 0.990 
1980 Hu 0.06 0.11 0.55 0.583 
1985 Ung -0.20 0.15 -134 0.180 
Period 
1974 Vu 0.03 0.04 0.77 0.442 
1976 V5 0.06 0.04 1.41 0.158 
1978 Vs 0.00 0.04 -0.04 0.967 
1982 Va -0.01 0.04 -0.36 0.718 
1984 V5 0.02 0.04 0.37 0.709 
1987 A —0.06 0.04 -1.52 0.129 
1988 V; -0.13 0.05 -2.76 0.006 
1989 Vs —0.06 0.05 -1.34 0.182 
1990 Vo 0.02 0.05 0.43 0.670 
1991 Vio 0.04 0.05 0.92 0.358 


1993 Vu 0.00 0.05 —0.09 0.926 
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TABLE 7.2 (continued) 
HAPC-CCREM of the GSS WORDSUM Data: 1974-2006 


Random Effects Coefficient se t Ratio p Value 
1994 Vio 0.02 0.04 0.49 0.623 
1996 Vis -0.06 0.04 -1.52 0.128 
1998 Em 0.04 0.04 1.02 0.306 
2000 Vis 0.01 0.04 0.11 0.915 
2004 Vig 0.04 0.04 0.88 0.381 
2006 Vy 0.05 0.05 1.16 0.247 

Variance Components Variance se z Statistic p Value 

COHORT Za 0.03 0.01 2.56 0.010 

PERIOD te 0.01 0.00 1.49 0.135 

Individual o? 3.12 0.03 104.87 «0.001 

Model Fit 

Deviance 87707.2 df = 21,999 


for by the individual-level regressors. Level 2 variance components results 
indicate that variation by cohorts is 0.03 and statistically significant (p « .01), 
whereas there is little variation by time periods after controlling for age and 
other individual covariates. Examining further the estimated average effect 
coefficients for cohorts (see also Figure 5.3), it can be seen that the estimated 
effects are particularly negative for the 1905-09 cohort and particularly posi- 
tive for the 1940—44, 1945-49, and 1950—54 cohorts. There also is a negative 
trend from the 1960-64 to the 1980—84 cohort. 

Examining next the estimated individual-level coefficients in Table 72, 
it can be seen that the qualitative results are similar to those reported in 
Table 71—a quadratic age effect, a positive effect for females, a negative 
effect for blacks, and a highly significant positive effect for education. Taken 
together, these regressors account for about 3076 of the unconditional level 1 
(individual-level) variance. The estimated regression coefficients and their 
standard errors are numerically quite similar between the two tables for 
the sex, race, and education variables. Estimates for the linear component 
of the quadratic age curve are quite another story, however. The estimated 
coefficient for this term is reduced from a highly statistically significant .18 
in the pooled regression model without controls for periods or cohorts of 
Table 71 to a marginally significant .03 in Table 72, after cohort and time 
period effects are taken into account. This implies that a failure to control 
for the effects of cohort and period variation in vocabulary knowledge could 
lead to large overestimates of the increases in verbal ability that are related 
to aging from young adulthood into the middle-age years. 

In sum, our results lend support for some aspects of both sides of the 
debate on the intercohort decline in vocabulary knowledge in the United 
States. First, the HAPC analyses found evidence in support of the quadratic 
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age effect on vocabulary knowledge hypothesized by Wilson and Gove 
(1999a, 1999b). However, the linear effect (which indicates the extent to which 
the quadratic age curve of vocabulary knowledge increases with age) was 
reduced to statistical insignificance when controls were introduced for the 
random effects of time periods and cohorts. Furthermore, controlling for 
the effects of key individual characteristics in the HAPC analyses (namely 
sex, race, and education) did not explain away all the age effects. We found 
that about 1% of variation in verbal scores at the individual level was due 
to the quadratic effect of aging after controlling for the random effects of 
cohorts and periods as well as the individual-level covariates of sex and 
race. This was about three times the "one-third of 1%” found by Alwin and 
McCammon (2001) in regressions that controlled for cohort effects but not for 
period effects or individual-level covariates. 

Second, we found only evidence of modest time period effects. This sup- 
ports the contentions of Alwin and McCammon (1999) that period effects in 
the GSS vocabulary knowledge data are relatively minor. The presence of 
these effects, however, affects the estimates of age and cohort effects. 

Third, the HAPC analyses found evidence in support of the contentions 
of Alwin (1991) and Glenn (1994, 1999) that there has been an intercohort 
decline in vocabulary knowledge. In fact, we found evidence of a bimodal 
curve of cohort effects. There was evidence of a peak in vocabulary knowl- 
edge for cohorts born in the 1940s and perhaps the early 1950s. But, our anal- 
yses also suggested a deficit for birth cohorts from the first decade of the 
twentieth century. Relative to this early century decline, vocabulary knowl- 
edge showed a secondary peak in the immediately following cohorts, thus 
yielding a bimodal cohort curve not found in previous studies. 


7.6 Assessing the Significance of Random Period 
and Cohort Effects 


The development of HAPC models may provide a useful apparatus for mod- 
eling and estimating distinct age, period, and cohort effects in repeated cross- 
sectional survey designs. In this context, however, the question arises: How 
can one assess or judge the significance of estimates of cohort and period 
effects in such models?" This question may be addressed by examining the 


` Parts of Section 7.6 are adapted from Frenk, S. M., Y. Yang, and K. C. Land. Assessing the 
significance of cohort and period effects in hierarchical age-period-cohort models with applications to 
verbal test scores and voter turnout in U.S. Presidential elections. Under review. 


Hierarchical APC-Cross-Classified Random Effects Models, Part I 209 


statistical significance of the estimated effect coefficients for each individual 
cohort and time period in a study. But, it may be the case that, say, some 
cohorts have statistically significant effect coefficients and some do not, and 
the same may be true for the estimated period coefficients. In such a case, 
how does one assess the overall statistical significance of the cohort or the 
period effects? Beyond statistical significance is the question of substantive 
significance. It could be the case, for example, that most of the individual 
estimates of cohort or period effects are not statistically significant at a con- 
ventional level of significance, but they exhibit an interesting trend or pat- 
tern that merits substantive interpretation. 

In the context of empirical applications of LM and GLMM specifications 
of HAPC models, we describe a two-step approach and set of guidelines to 
address these questions that build on a large body of literature on methods 
for hypothesis testing in mixed (fixed and random effects) models in statis- 
tics. We claim no originality for these general statistical methods. Rather, the 
object of this section is to organize them into a set of methods specifically 
adapted to the features of HAPC models and to illustrate their application 
in the context of empirical analyses of two specific datasets: GSS data on 
trends in verbal ability and NHANES data on trends in obesity. In addition, 
substantive findings from the empirical applications clearly demonstrate the 
dominance of cohort effects in the former case and period effects in the latter 
and thus help to resolve long-standing empirical questions and disputes in 
each case. The procedure and guidelines that are articulated and illustrated 
in these two empirical analyses can be readily adapted and applied more 
generally to other empirical HAPC analyses. 


7.6.1 HAPC Linear Mixed Models 


To assess and test the statistical significance of the cohort and time period 
effects estimated from the HAPC-CCREM analysis of the GSS data shown in 
Table 7.2, we suggest a two-step approach. 


7.6.1.1 Step 1: Study the Patterns and Statistical Significance of the 
Individual Estimated Coefficients for Time Periods and Birth Cohorts 


As an initial step, the individual estimated period and cohort effects should 
be studied both for substantively meaningful patterns and statistical signifi- 
cance. This can be done in two parts. 

Step 1.1: Graphically Plot the Temporal Sequences of Estimated Cohort 
and Period Effect Coefficients 

While the numerical values of the estimated cohort and period effects 
in Table 72 contain the same information, as a first step in the analysis of 
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Predicted Verbal Test Score with 95% 
Confidence Bounds 


Cohort 


Predicted Verbal Test Score with 95% 
Confidence Bounds 


Period 


FIGURE 7.1 
Estimated cohort and period effects and 95% confidence bounds for GSS verbal ability model. 


their substantive and statistical significance, we also graphically plotted the 
estimates. This facilitates a quick visual check of the extent to which the 
estimated effects exhibit temporal trends or patterns that are of substan- 
tive significance. Particular periods or birth cohorts that stand out also may 
be identified. 

Figure 71 contains graphs of the temporal sequences of the estimated 
cohort effects Ge, the estimated Bo; = Yo + uo; cohort verbal test score effects 
averaged over all time periods for each cohort j) and the time period effects 
De, the estimated Bo, = Yo + Vog period verbal test score effects averaged over 
all cohorts for each time period k) with their 95% confidence bounds. Each 
graph also has a horizontal line at 6.175, the numerical value of the estimated 
Yo or intercept coefficient reported in Table 72. This line facilitates a visual 
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inspection of those cohorts and time periods, if any, that have substantial 
deviations from the overall intercept. 

With 20 and 17 observations, respectively, the 95% confidence bounds for 
the birth cohort and time period effects are relatively broad. The pattern of 
the estimated time period effects does not show substantial variations or sys- 
tematic temporal patterns, with only the 1988 value deviating substantially 
from the overall average. By comparison, it can be seen that the estimated 
temporal pattern of birth cohort effects contains some gyrations that are 
quite pronounced and relevant to substantive debates concerning historical 
trends in verbal ability, which we have already discussed. 

Step 1.2: Examine the Statistical Significance of Individual Cohort and 
Period Effect Coefficients 

Turning from visual and substantive assessments of estimated cohort and 
period effects, the next step is to examine the statistical significance of the 
individual effect coefficients for the birth cohorts and time periods—the esti- 
mates of the uy; and the vp, random effects with the null hypothesis in each 
case being that the respective coefficient is zero, that is: 


Ho:uo;= 0 versus H, : uoj £0, and Ho: vox =0 versus H,: 0% # 0. 


If these null hypotheses are not rejected, this implies that the mean of the 
WORDSUM outcome variable for the jth time period or the kth cohort is no 
different from the overall average. The coefficients and their standard errors 
for the birth cohorts and time periods in Table 72 were estimated by the 
REML-EB method, and their ratios can be interpreted as asymptotic/large- 
sample f ratios in the conventional way (Raudenbush and Bryk 2002: 57-58). 

As will often be the case with HAPC models, some cohorts and time peri- 
ods may have statistically significant effects, as measured by t ratios, while 
others do not. For instance, in Table 72, the effect coefficients for the 1905-9, 
1915-19, 1940—45, 1945—49, 1950-54, 1965-69, and 1970-74 cohorts are statisti- 
cally significant at conventional levels, whereas those for the other cohorts 
are not. For time periods, only the effect coefficient for 1988 attained conven- 
tional levels of statistical significance. 

These assessments of statistical significance of the individual cohort and 
period estimated coefficients are consistent with the graphical representa- 
tions in Figure 71. Given that the estimated cohort and period coefficients 
are displayed in the figure with their 95% confidence bounds, there is a cor- 
respondence between those cohort and period coefficients with asymptotic 
t ratios that are statistically significant at the .05 level in Table 6.2 and those 
for which the 95% confidence bounds do not cross the 6.175 horizontal line 
in Figure 7.1—that is, the 1905-9, 1915-19, 1940-45, 1945-49, 1950-54, 1965—69, 
and 1970-74 cohorts and the 1988 period. In general, however, even when 
none of the individual birth cohort and time period coefficients are statisti- 
cally significant, it often is useful to examine graphically the patterns of each 
set of coefficients for trends that could be of substantive interest. 
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7.6.1.2 Step 2: Test for the Statistical Significance of the Period 
and Cohort Effects Taken as a Group 


When some cohort and period effect coefficients attain statistical significance 
but some or most do not, we next may address the question of whether the 
birth cohort or time period effects, taken as a set, are statistically significant. 
This question is one of whether these effects, taken together, contribute to 
explained variance in the model. 

Step 2.1: Deviance and Variance Components Analysis 

A first approach to answering this question is to study the model devi- 
ance statistic and variance components. The deviance is defined as minus 
twice the natural logarithm of the likelihood of an estimated model and can 
be regarded as a measure of lack of fit between model and data. In Table 7.2, 
the bottom rows show that the deviance statistic is very large compared 
to the degrees of freedom of the model, thus indicating a highly significant 
association of the explanatory variables with the WORDSUM response vari- 
able. In addition, the variance components show that most of the variance 
in WORDSUM is the individual-level regressors at level 1. Level 2 variance 
components results indicate that variation by cohorts was statistically sig- 
nificant, whereas that for time periods was not. This variance component 
analysis based on z scores is consistent with the results noted for the individ- 
ual cohort and period coefficients. That is, a sufficient number of estimated 
cohort effects were statistically significant for the cohort variance component 
to attain statistical significance. But, since only one of the time period effects 
was statistically significant, the overall contribution of the random effects for 
time periods was not sufficiently large for its variance component to attain 
statistical significance. 

Step 2.2: An F Test for the Presence of Random Effects 

The previous results were obtained using REML, which rests on the 
assumption that the error terms are asymptotic normally distributed and 
yield random effect estimators with good large-sample properties. When 
the number of level 2 units, in this case cohorts and periods, is not large, 
this assumption may not be appropriate. And, the z scores for the REML 
estimates of the variance components are only proximate. To further test 
whether the birth cohort and time period effects make statistically signifi- 
cant contributions to explained variance in an outcome variable, a general 
linear hypothesis may be applied. Specifically, one can use an F test to test 
the hypothesis of the presence of random effects. The sampling distribution 
of the F statistic is exact when the random effects are independently distrib- 
uted as normal random variables. This test statistic is preferred over the z 
score when the sample sizes for random effects are small (Littell et al. 2006). 
The statistical theory for such tests has been developed in a very general 
LMM context (Demidenko 2004). 

In the present case, for the CCREM model of Equation (79), there are only 
two sets of random effect coefficients that are estimated, namely, the set of 


Hierarchical APC-Cross-Classified Random Effects Models, Part I 213 


residual random effects of cohort j, Hj, and the set of residual random effects 
of period k, oy. Each of these sets of random coefficients is assumed to be 
independently, normally distributed with mean 0 and variances 7, and 7,, 
respectively. Thus, for a CCREM model with random intercepts, the exact F 
test amounts to testing null hypotheses for the relevance of either the birth 
cohort random effects: 


Hy T, = 0 versus H; t, > 0 (7.16) 
or the time period effects: 
Hy: t, = 0 versus H; T, > 0. (717) 


Alternatively, one can test for the joint relevance of both the cohort and the 
period effects: 


Hy T, = T, = 0 versus H; T, > Ü or t, > 0 (718) 


These null hypotheses correspond to situations, respectively, in which the 
levels of variation in the cohort effects, the period effects, and the cohort and 
period effects taken together do not differ significantly from zero. 

The results of the F tests for hypotheses (716)-(7.18) for the GSS data are 
summarized in Table 73. Consider the case of the null hypothesis (716) in 
which the birth cohort effects are not relevant to explaining variation in the 
verbal test score outcome variable of Equation (79). The idea of the F test is 
that, when the variance of the random birth cohort effects t, = 0, the dif- 
ference between the minimum sum of squares (SS) of the model (79) with 
random cohort and period effects Smin and the minimum SS without the 


min 


TABLE 7.3 
F-tests for the Presence of Random Effects, GSS WORDSUM Data (1974—2006) 
Cohort Effects Period Effects Cohort and Period Effects 


t,=Ovs.t,>0 t,=Ovs.t,>0 Tt, = T, = O VS. T, ort, > 0 
Sors 69,377 69,377 69,377 
S nin 68,696 69,268 68,558 
R 25 22 42 
M 5 5 5 
Nr 22,042 22,042 22,042 
(Sors — S,in) / (r — m) 34.05 6.41 22.14 
Smin/ (Nr - 1) 3.12 3.15 3.12 
F 10.9 2.03 7.10 


elt —m, Nr — r) 1.57 1.62 1.41 
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random effects, as estimated by an OLS regression on the level 1 explanatory 
variables Sors, should be close. Accordingly, we compute the residual SS: 


N 
Sors = 3 


i=1 


Hr X Bors (7.19) 


for an ordinary fixed effects regression model that assumes no random 
effects of cohorts or time periods, as shown in the pooled regression model 
without controls for periods or cohorts of Table 7.1. Next, the minimum SS in 
the presence of the random effects, that is, the minimum 


Sming = mins |y - Wë|° (7.20) 


where the matrix W consists of the matrix X of observations on the individual- 
level explanatory variables adjoined with a design matrix Z for the random 
cohort effects, that is, W = [X, Z], and ó = (Bu). In this example, Sors = 69,377 
and Smin = 68,696. Under the null hypothesis (710), it can be shown that the 
ratio of two quadratic forms has an F distribution, or more precisely: 


(Sors — Bois )/(r — m) 
Sri /(Nr —r) 


— F(r —m,Nr — r) (7.21) 


where N; denotes total sample size, r is the rank of the matrix W, and m 
is the number of explanatory variables in the OLS regression (Demidenko 
2004: 137). When random cohort effects are present in a LMM model, such as 
Equation (79), that is, when t, is nonzero, Smin should be relatively small so 
that the ratio (715) becomes large. Thus, we reject the null hypothesis (7.10) if 
the left-hand side of (7.15) is large. More precisely, let 1 — « be a chosen signifi- 
cance level, for example, o = 0.05, and foss be the quantile of the F distribution 
with r — m and Nz- r degrees of freedom. Then, the H, is rejected when the 
ratio in (715) exceeds f. os. 

To apply the F test (715) to model (79), note first that, under the assumption 
that the explanatory variables in the X matrix are linearly independent, the 
rank of X is m, and the number of explanatory variables is five in this case. In 
addition, since individuals in the pooled GSS data may be members of differ- 
ent birth cohorts, the columns of the design matrix Z for the random effects 
will be linearly independent and thus have rank 20. Therefore, in the numer- 
ator of (715), r = 25 and m = 5, which gives r — m = 20 degrees of freedom. 
In the denominator, Nr — r = 22,042 — 25 = 22,017. Under the null hypothesis 
that the cohort effects have zero variance in the GSS verbal test score analy- 
sis, the F ratio is 10.9. With 20 and 22,017 degrees of freedom, this far exceeds 
the critical value fo; = 1.571. The corresponding F ratio for hypothesis (7.11) of 
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period effects is 2.03, which also exceeds the critical value of 1.623. Thus, we 
reject the null hypotheses that the variance of either the birth cohort random 
effects or the period random effect is zero and conclude that inclusion of 
these sets of random effects is relevant to the explanation of variation of the 
GSS verbal test score data. In addition, the F test can be applied to the sets of 
random effect coefficients taken as a whole, that is, to test the null hypothesis 
(712). In this case, the Z matrix is expanded to include both u$ and v, which 
changes the rank of W to 42 (= 5 + 20 + 17). The F ratio is 7.096, which is sig- 
nificant at the .05 level. 

The foregoing analyses indicate that there is evidence that the two sets of 
random effects taken together contribute significantly to the explained vari- 
ance. Note that the z score reported in Table 72 for the variance component 
for period effects is 1.49 (p = .135), indicating failure to reject the null hypoth- 
esis of zero variance for the period coefficients. The F tests described here 
indicate the opposite, that is, that the period effects contribute significant 
variability that should not be ignored in the model. Because there are only a 
few levels of the period random effects, the F SS method is a more statistically 
sensitive method for testing hypotheses about the variance components than 
the z-score method. In particular, the F tests typically will indicate statistical 
significance of either the cohort or the period effects, taken as a whole, when 
at least one of the members of these sets of effects is statistically significant, 
as is the case for the estimated period effects reported in Table 7.2. 

Statistical significance does not, of course, equate to substantive impor- 
tance, but it does indicate a contribution to explained variance. Substantively, 
taking into account all of the foregoing assessments of individual coefficients 
and sets of coefficients, it must be concluded that, while there is evidence of 
statistical significance of one time period effect, and while this is sufficient to 
conclude that the period effects make a statistically significant contribution 
to explained variance, the dominant explanation on trends in verbal ability, 
as measured by the GSS WORDSUM data, is a cohort as opposed to a period 
one. That is, net of the effects of individual-level covariates in accounting for 
temporal trends in individuals' verbal test scores, cohort effects are much 
more prominent than period effects, and researchers should indeed study 
cohort-based explanations for these trends. 


7.6.2 HAPC Generalized Linear Mixed Models 


The HAPC approach to modeling age, birth cohort, and time period effects 
developed by Yang and Land (2006) is not restricted to applications to nor- 
mally distributed outcome variables that can be modeled by a Gaussian 
LMM model specification. Rather, the HAPC approach can be applied to 
dichotomous and multiple categorical outcome variables. For such outcome 
variables, the HAPC framework takes the form of GLMM specifications. Our 
suggested approach to testing for the statistical significance of the random 
effects again has two steps. 
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To illustrate the significance testing guidelines for the class of GLMM 
formulations of the HAPC-CCREM mode, we study an application of one 
of the most important classes of GLMM models, namely, that of a logistic 
response function for a dichotomous outcome variable. Specifically, consider 
the example of obesity epidemic in the United States and the contributions 
of time period and birth cohort effects to the epidemic. For this analysis, 
we pooled data from the NHANES 1971-2008 values on obesity status and 
a number of individual-level covariates that may affect the probability of 
being obese. We focus for now on key social demographic and SES variables 
as other risk factors, such as health behaviors and diet, are not available for 
all survey years, and their inclusion would substantially reduce the sample 
size. To model the likelihood of obesity, we specified the following HAPC 
mixed model: 


Level 1 or ^within-cell" model: 


Logit Pr(OBESE;, = 1) = Boz T Bi, AGE ii. + B» AGE, + B3SEX i. + BaRACE ix 
(7.22) 
+ BsEDUCATION ix. + BsINCOME i 


Level 2 or “between-cell” model: 


Dun = Yo + oj + Ont , Uoj e N(0,T,) s Vok ~ N(0,1,) (723) 


Combined model: 


Logit Pr(OBESE;, = 1) = Yo + D AGE ii + Br AGE + B3SEX i. + BsRACE ix 


+ BSEDUCATION ig: + B4INCOME jx (7.24) 
= Uoj + Vok 
for 
i= 1,2, ..., ny individuals within cohort j and period k; 
j= 1, ..., 17 birth cohorts; 
k= 1, ..., 9 survey years. 


This model, similar to that for the GSS verbal test score, specifies that 
the intercept or the expected mean has random period and birth cohort 
effects. The level 1 model, however, is specified in terms of the logit of the 
probability of being obese (p) modeled as the log-odds of obesity, logit(p) = 
log[p/(1 — p)], that is, as a logistic response function. This moves the HAPC- 
CCREM model from the LMM family of statistical models into the GLMM 
family. As indicated by previous studies of GLMMs, we typically assume 
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multivariate normality for random effects (Raudenbush and Bryk 2002). As 
described previously, we centered the age variable around the grand mean. 

Table 74 reports parameter estimates and model fit statistics for models 
(7.22)- (7.24) obtained from the SAS PROC GLIMMIX (Littell et al. 2006) (see 
the sample codes). All individual-level covariates had statistically signifi- 
cant coefficients that were consistent with prior research. Specifically, the 
effect of age was curvilinear, indicating increases in the risk of obesity with 
age that decelerated at older ages. Males, whites, those with some college 
education or college degrees, and those in the highest income quartile were 
less likely to be obese than females, blacks, those with 12 years of educa- 
tion or less, and those in the middle income quartiles. People in the lowest 
income quartile are more likely to be obese than others. Beyond the effects 
of the individual-level covariates of this model, what is interesting about the 
results in Table 74, however, is that, as contrasted to those for the GSS verbal 
test score data described previously, of the two sets of random effects, period 
effects are more relevant to the explanation of obesity risk than are birth 
cohort effects, as measured by contributions to explained variance. 

To reach this conclusion, we applied the same sequence of graphical dis- 
plays and statistical tests of significance as identified for the LMM form of 
the HAPC-CCREM model. 

As in Step 1.1, graphically plot the temporal sequences of estimated 
cohort and period effect coefficients. As was the case for the estimated ran- 
dom effects of time periods and cohorts in LMM HAPC models, a first step is 
to examine graphical displays of the temporal patterns of the effects. 

Symmetrically with Figure 7.1, Figure 72 contains graphs of the temporal 
sequences of the estimated cohort effects and the time period effects with 
their 95% confidence bounds. The cohort effect is calculated as Bo; = Yo + Uo, 
where % = —0.587 is the intercept or estimated overall mean and Vue are the 
cohort-specific random effects coefficients, and converted to probabilities 
of obesity = exp(Bo)/(1 + exp(Bo)). The period effect is calculated as Bu = 
Yo + Vox, Where Vo are the period-specific random effects coefficients, and 
converted to probabilities of obesity. The graphs also have a horizontal line 
at 0.363, the transformed probability of obesity of the intercept estimate of 
Yo, of sample respondents at mean age and in the reference group. In con- 
trast to what was observed for the GSS verbal test score cohort estimates in 
Figure 71, the pattern for estimated cohort effects in Figure 72 is relatively 
constant, with cohorts born after 1935 showing more higher-than-average 
probabilities of obesity and a slightly more apparent upward trend for 
those born between 1960 and 1974. The graph of the estimated obesity time 
period effects, on the other hand, shows quite pronounced variations. The 
average probabilities of obesity increased rapidly from the early 1970s to 
2000 and gradually but continuously increased until 2008. 

Following the procedure of Step 1.2, examine the statistical significance 
of individual cohort and period effect coefficients. Of the individual esti- 
mated random effect coefficients for birth cohorts and survey periods given 
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TABLE 7.4 
HAPC-CCREM of Obesity Trends, NHANES 1971-2008 
Fixed Effects Parameter Coefficient se t Ratio p Value 
Intercept Yo -0.56 0.16 -3.56 0.007 
AGE B. 0.01 0.00 8.11 <0.001 
AGE2 p. —0.00 0.00 -11.06 «0.001 
SEX (male - 1) B, —0.42 0.03 -17.31 «0.001 
RACE (black - 1) By 0.46 0.03 16.49 <0.001 
EDUCATION Bs 
SOME COLLEGE (13-15 years) -0.08 0.03 -249 0.013 
COLLEGE (2 16 years) —0.40 0.04 -9.74 «0.001 
FAMILY INCOME p. 
LOWEST QUARTILE 0.13 0.03 4.51 «0.001 
HIGHEST QUARTILE -0.15 0.03 —.37 <0.001 
Random Effects Coefficient se t Ratio p Value 
Cohort 
1899-1904 UM 0.02 0.04 0.60 0.550 
1905-1909 Uy -0.03 0.04 -0.79 0.432 
1910-1914 Uz 0.03 0.04 0.88 0.378 
1915-1919 Uy 0.00 0.03 0.08 0.936 
1920-1924 Us —0.03 0.03 —0.82 0.411 
1925-1929 Ve 0.00 0.03 0.10 0.917 
1930-1934 Uy -0.02 0.03 -0.58 0.561 
1935-1939 Ug 0.03 0.03 0.86 0.390 
1940-1944 Uy 0.02 0.03 0.70 0.483 
1945-1949 llo 0.01 0.03 0.17 0.065 
1950-1954 Uy —0.04 0.03 -1.14 0.253 
1955-1959 1» 0.01 0.03 0.15 0.879 
1960-1964 ls -0.06 0.03 -1.64 0.102 
1965-1969 lla -0.01 0.04 -0.13 0.895 
1970-1974 s 0.04 0.04 1.12 0.264 
1975-1979 U6 0.02 0.04 0.46 0.642 
1980-1982 uz —0.01 0.04 —0.17 0.864 
Period 
1971-1975 MI —0.73 0.16 —4.66 «0.001 
1976-1980 V —0.70 0.16 —4.45 <0.001 
1989-1991 V3 —0.26 0.16 -1.61 0.107 
1991-1994 Va —0.04 0.16 —0.26 0.794 
1999—2000 Vs 0.34 0.16 2.08 0.038 
2001-2002 Ve 0.26 0.16 1.60 0.109 
2003-2004 V; 0.36 0.16 2.23 0.026 
2005-2006 Vs 0.41 0.16 2.58 0.010 
2007-2008 Vo 0.37 0.16 2.33 0.020 
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TABLE 7.4 (continued) 
HAPC-CCREM of Obesity Trends, NHANES 1971-2008 


Variance Components Variance se z Statistic p Value 
COHORT Tuo 0.00 0.00 1.05 0.147 
PERIOD Too 0.22 0.11 1.99 0.024 
Model Fit 

-2 Res Log Pseudo-Likelihood 186586.7 — df=40228 


2 
Rel 
Ó s 064 
2s 
32 054 
Ed 
sg 
[el 
ZS 0.4 - 
KE 
= = 
= Š oj J 
BO 
° = 
SE 0.2 4 
EK 
37 OI T T T T T T T T T T T T T T T T 1 
2 dO hä ads QO od oO sd oh Q ds Q ad A d 
o o Y o S o S o e o NA o I o Y o S 
SPY OY OY O O O od? o9 IF FO” Ai S oj 
ONIYI III DD DS A A ES: 
Cohort 


0.4 4 


° 
N 
l 


with 95% Confidence Bounds 


Predicted Probabilities of Being Obese 
° 
Ë 


° » 9 » © d 
L S ër q^ SF gv qu < iS 
zx zy Se Fd Bi n " z Z. 
Y ab éi ey qu € KF YF < 
> S Y Zë % % 3$ 
Period 


FIGURE 7.2 
Estimated cohort and period effects and 95% confidence bounds for NHANES obesity model. 
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in Table 74, it can be seen that only one cohort, the 1960-64 birth cohort, had 
a t ratio that is statistically significant at the o = 0.10 level. By comparison, the 
time period coefficients for the majority of survey periods were significant 
at the ot = 0.05 level. 

Perform deviance and variance components analysis as in Step 2.1. As 
was the case for the GSS verbal test score data, the deviance statistic reported 
at the bottom of Table 74 shows that the full CCREM model explains much 
variance in the obesity outcome variable. In contrast to the verbal test score 
example, however, the variance component analysis indicated that the set of 
period random effect coefficients had a more statistically significant contri- 
bution to explaining obesity than the set of birth cohort coefficients. 

Per Step 2.2, perform an F test for the presence of random effects. The 
model of Equations (7.22)-(7.24) is that of a logistic regression model with a 
random intercept. Demidenko (2004: 374—375, 408—409) showed that the F test 
(7.21) can be generalized to develop an asymptotic F test of the null hypoth- 
esis that the intercepts are constant or homogeneous in a logistic regression 
model with random intercepts. We next describe and apply this test. 

To generalize the F test (7.21) and to specialize the homogeneity test of 
Demidenko (2004: 374—375, 408) to GLMM formulations of HAPC-CCREM 
models, recall that the deviance statistic (twice the negative log-likelihood 
function l) asymptotically behaves as the SS (McCullagh and Nelder 1989). 
Given this, 5 in the test statistic (7.21) can be replaced by -21 to obtain 


(Io Wes Ioax)/ (r ei m) 
Toi: /(Nr ~ r) 


= Fr —m,Nr — r) (7.25) 


where l, is the maximum of the log-likelihood of the standard level 1 logistic 
regression model with no controls for cohort and period effects, and Lmax is the 
maximum of the log likelihood treating the cohort and period effects as fixed 
parameters. Demidenko (2004: 54-55) showed that a fixed effects model that 
treats the random effects as fixed corresponds to a random effects model 
with infinite covariance matrix. Thus, lmax is an upper-bound estimate for 
the SS of the mixed model. And, as in (721), Nr denotes total sample size, r is 
the rank of the matrix W, and m is the number of explanatory variables in the 
level 1 model. 

To apply the asymptotic F test (7.25) to the NHANES obesity model (7.24), 
note that m — 8, the number of explanatory variables in the level 1 logistic 
regression model, and r = 34 = m + the number of birth cohorts + the number 
of election time periods = 8 + 17 + 9. Then, the value of the F ratio for the 
statistical significance of the birth cohort effects alone is 


F = 47.8, p < 01 
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that for the statistical significance of the period effects alone is 
F = 96.8, p < .01 

and that for the statistical significance of cohorts and periods together is 
F =34.8, p < .01 


Numerical details for these F tests for hypotheses (722)-(724) for the 
NHANES data are given in Table 7.5. 

In agreement with the variance components analysis using the z statis- 
tics reported in Step 2, these F-test results show that the incorporation of 
time period random effects into the model produced a statistically signifi- 
cant variation in the level 1 model intercept. Different from the z-test result, 
however, incorporation of birth cohort random effects also produced sig- 
nificant variation in the intercept by the standard of the F test. In addition, 
the incorporation of both sets of effects produced a statistically significant 
F ratio. These Step 4 results corroborate the inferences regarding the period 
effects on obesity from Steps 1, 2, and 3 and suggest an additional source of 
variation contributed by birth cohort. We note that to estimate the mixed 
logistic models, existing software programs like SAS use a pseudo-ML esti- 
mation algorithm in which a consistent and asymptotically normal estimate 
of certain model parameters is computed rather than a ML estimate. In such 
contexts, the log-likelihood functions necessary to calculate the asymptotic 
F test may be only approximate. Substantively, taking into account all of the 
foregoing assessments of individual coefficients and sets of coefficients, it 
must be concluded that, while there is some evidence of statistical signifi- 
cance of birth cohort effect, the main story on trends in obesity epidemic in 


TABLE 7.5 
F-tests for the Presence of Random Effects, NHANES Data (1971-2008) 


Cohort Effects Period Effects | Cohort and Period Effects 
t,=Ovs.t,>0 t,=O0vs.t,>0 Ç, = T, = 0 vs. T, ort, > 0 


l 43313 43313 43313 
PN 42456 42395 42359 
r 25 17 34 
m 8 8 8 
Nr 40,261 40,261 40,261 
(ly — lnax)/ (r — m) 50.41 102.00 36.69 
Las (Nr f) 1.06 1.05 1.05 
F 47.78 96.82. 34.85 


foss(r — m, Nr — r) 1.62 1.88 1.50 
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the U.S. population, as measured by the objective body mass index (BMI) 
data, is a period story rather than a cohort story—the opposite of what we 
concluded for trends in verbal ability. 

Future applications of the HAPC model in the similar forms as specified 
for the GSS and NHANES data are likely to find some instances in which 
either time period or cohort effects are effective, and there may also be 
instances for which both types of contextual effects may be operative. Use of 
the HAPC modeling apparatus and decision guidelines to assess statistical 
and substantive significance described in this section should be quite fruit- 
ful in disentangling the age, period, and cohort sources of temporal change 
in such studies. 

It merits noting that the second part of Step 2 in our strategy, that of an 
application of a formal statistical test of homogeneity of the estimated ran- 
dom effect coefficients of cohort and period effects, may become more chal- 
lenging or even intractable when more complicated versions of GLMMs are 
used for the HAPC analysis. The comparisons of the z- and F-test results 
based on the two models suggest that the former tends to yield results more 
conservative than the latter, while the benefit added by the application of the 
F test is the detection of the combined random components of period and 
cohort. As these tests have only been developed for the random intercept 
models, we recommend the analyst make a cautious assessment of whether 
the F test can be reasonably applied and interpreted in other kinds of mixed 
models, such as the random coefficients APC models we introduce next. 


7.7 Random Coefficients HAPC-CCREM 


The mixed regression models approach not only is methodologically rele- 
vant for APC analyses but also enhances our ability in addressing additional 
questions that cannot be addressed in linear fixed effects models but bear 
theoretical importance to studies of social change and heterogeneity. First, 
the basic random intercept models (79) and (724) introduced specify fixed 
effects of all individual-level covariates. The results of the analyses of the 
GSS verbal test score data and the NHANES obesity data have shown that 
the fixed effect coefficients of these covariates are statistically significant, and 
their inclusion also accounts for some of the age effects and variance compo- 
nents. Could these level 1 coefficients also have random variation, just as the 
intercept, such that the associations between individual-level covariates and 
the outcome may vary depending on period and birth cohort? While classi- 
cal fixed effects linear regression/accounting APC models are often confined 
to the specification of age effects, the HAPC-CCREM can more easily specify 
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temporal variations in other individual-level covariates that may substan- 
tially broaden the scope of investigations. In the example that follows, we 
extend the NHANES obesity analysis in the previous section to show how 
the specification of random coefficients models enables us to detect increas- 
ing social demographic differentials in obesity rates across birth cohorts. 

The results of the previous analysis suggested weak cohort variations in the 
overall intercept or mean probability of obesity. Given findings from an earlier 
study that suggested population heterogeneity in the period and cohort etfects 
from the National Health Interview Survey (NHIS) data (Reither, Hauser, and 
Yang 2009), we further modeled random effects of level 1 effect coefficients. That 
is, we tested the hypothesis about whether sex, race, or socioeconomic status 
(SES) disparities in obesity varied significantly by time period or birth cohort. 

The level 1 model specification is similar to Equation (7.22) but allows the 
level 1 coefficients to have random effects: 


Logit Pr(OBESE ix = 1) = Boje + Big AGE; + Boje AGE + Bs SEX i (7.26) 
F Ba RACE; + Bs EDUCATION + B | INCOME; 


The level 2 model is then revised as follows: 


Sex effect: 

Pajk = Ya + U3j + Oak (7.27) 
Race effect: 

Pajk = Yat Maj + Vax (7.28) 
Education effect: 

Bs jk = Y5 + Us; + Usk (7.29) 
Income effect: 

Boje = Yo Ue; + Vox (7.30) 


These models tested the hypotheses about period and cohort effects 
through the specifications of random variance components for the random 
coefficients for sex, race, education, and income effects. The age effects could 
potentially also vary from cohort to cohort or from period to period. In this 
analysis, however, such random age effects were not statistically significant 
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or of theoretical interest. We therefore could constrain the age effects to be 
fixed just as in the analysis of the GSS verbal test score data in Section 75. 
Therefore, we have omitted the level 2 random effects models for these two 
coefficients, which yielded B,;, = Y, and B; = v. In this set of equations, y; to 
Ys are the level 2 fixed effects coefficients that represent the fixed effects of 
sex, race, education, and income, respectively. In the level 1 model, we can 
test the life course hypotheses about age variations in social gaps in obesity 
by including interaction terms of age with each of the other level 1 covari- 
ates. In fact, the specification of level 1 model that accounts for the functional 
form of level 1 covariates' effects across age preceded that of the level 2 
model. This process is explicated in the analysis of happiness in the next 
chapter. To test whether these differentials varied by time or birth cohort, 
the equations specified that their coefficients have cohort effects 14 — Ver and 
period effects v3, — v4, whose corresponding random variance components 
are T,,3 — T, and T,3 — Le, Based on the combined models of (7.26)-(7.30), we 
next estimated logit CCREMs of obesity using SAS PROC GLIMMIX (Littell 
et al. 2006) (see the sample codes). 

Table 76 summarizes the results of the random coefficient model esti- 
mates. The fixed effects estimates were largely similar to those reported in 
the previous analysis using the random intercept models (Table 74). Also 
similar to the previous analysis, the estimates of random effects in terms 
of residual variance components at level 2 indicated significant period but 
not cohort effects on the overall intercept. Level 1 coefficients of two covari- 
ates—sex and race—showed significant random effects, whereas those of 
SES variables did not. There seem to be cohort changes in sex gaps in obesity 
and period changes in sex and race gaps in obesity that are significant at 
a = .10 level. The race gaps showed larger variations across cohorts than time 
periods that are significant at o = .05 level. To see more clearly how specific 
sex and race groups changed in the risk of obesity across cohorts, we display 
the estimated random cohort effects in Figure 73 in terms of the predicted 
probabilities of being obese for each cohort at the mean age and averaged 
over all years by sex and race. 

For the reference group of white females with 12 years or less of education 
and in middle income quartiles, the predicted probabilities were calculated 
the same way as those for Figure 72. For black males, we first calculated the 
random effect coefficients as B; + Bs; + Bay = Yo + Ys + % + Uoj + us; + uy (where 
Yo = —0.563 is the intercept or estimated overall mean; $, = —0.424 is the esti- 
mated fixed sex effect coefficient; y, = 0.458 is the estimated fixed race effect 
coefficient; uy are the intercept cohort random effects; us; and uu; are the sex 
and race-specific random effects, respectively) and converted them to pre- 
dicted probabilities. Similar calculations yielded results for white males and 
black females. The figure shows that black cohorts experienced more pro- 
nounced increases in risks of obesity than white cohorts, controlling for the 
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TABLE 7.6 
The Random Coefficient CCREM Estimates of Obesity: NHANES 1971-2008 
Fixed Effects Parameter Coefficient se tRatio p Value 
Intercept Yo -0.57 0.15 -3.79 0.004 
AGE JA 0.01 0.00 8.46 «0.001 
AGE Y. —0.00 0.00 -10.81 <0.001 
SEX (male = 1) Ys —0.43 0.06 -7.44 <0.001 
RACE (black = 1) Ya 0.45 0.07 6.55 0.001 
EDUCATION ys 
SOME COLLEGE (13-15 years) —0.08 0.03 —2.38 0.017 
COLLEGE (> 16 years) —0.40 0.04 —9.80 <0.001 
FAMILY INCOME Ye 
LOWEST QUARTILE 0.13 0.03 4.44 «0.001 
HIGHEST QUARTILE -0.14 0.03 4.26 <0.001 
Variance Components Variance se z Statistic p Value 
COHORT 
Intercept To 0.00 0.00 0.78 0.217 
SEX T3 0.01 0.01 1.41 0.079 
RACE T4 0.03 0.02 1.68 0.047 
PERIOD 
Intercept Tyo 0.19 0.10 1.97 0.024 
SEX Tys 0.02 0.01 1.39 0.082 
RACE T 0.02 0.01 1.44 0.075 
Model Fit 
-2 Res Log Pseudo-Likelihood 188268.0 | df-40180 


age and period effects and other factors. Consistent with the earlier study by 
Reither, Hauser, and Yang (2009), both black females and males showed par- 
ticularly large increases for cohorts born after 1955-60. And black females 
were the most likely to be obese on average among all groups examined, with 
the peak cohort of 1970-74 having a 52% predicted probability of obesity. The 
black and white gaps increased for both sexes across cohorts, contributing to 
the significant random variance of the race coefficient in Table 7.6. 

In sum, the findings corroborated those from prior studies using other 
data sources and analytic methods. We have shown that the obesity epidemic 
in the overall population is a period-driven phenomenon. There are, how- 
ever, substantial social demographic variations in cohort trends of obesity. 
Increasing racial differentials in cohort trends of obesity are particularly 
worthy of attention as the surge in the obesity risks in most recent cohorts 
was restricted to blacks. 
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FIGURE 7.3 
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Appendix 7.1: Matrix Algebra Representations of Linear 
Mixed Models and Generalized Linear Mixed Models 


A general linear mixed (fixed and random) effects statistical model (LMM) 
may be written as 


y = XB+ Zo +€ (7.31) 


where y is a vector of observations, X is a matrix of observations on known 
covariates (regressors), B is a vector of unknown regression coefficients to be 
estimated, Z is a known design matrix, & is a vector of random effects, and 
£ is a vector of errors. Both o and e are unobservable random variables. The 
assumptions of this model are that the random effects and errors have mean 
zero and finite variances with covariance matrices G = Var (o) and R = Var (ei 
that usually involve some unknown dispersion parameters or variance com- 
ponents to be estimated from the data. Note that this is a vector-matrix version 
of the "combined model" form of the HAPC models described in this chapter. 

In most applications of LMMs, the oo and e’s are assumed to be inde- 
pendent. Equivalently, net of the effects of the observed covariates and the 
random effects, the observations in the y vector are assumed to be indepen- 
dent. Then, the covariance matrices reduce to scalars. Specifically, under this 
specification, the random effects 0, ..., Ga are assumed to be independently 
and identically distributed (i.i.d.) with mean 0 and variance o, and the errors 
are i.i.d. with mean 0 and variance ?. And, the usual assumption of LMMs is 
that the random effects and errors also are Gaussian/normally distributed, 
that is, & ~ N(0, o?) and £ ~ NI, 1?). This specification defines the class of 
Gaussian LMMs, and, while other distributional specifications on the ran- 
dom effects (such as t distributions) have been studied in statistics, it is the 
form of LMMs typically programmed in LMM software and therefore most 
widely applied empirically. 

To give a corresponding general representation of the class of Gaussian 
LMMs, we first give an alternative expression of the Gaussian LMMs just 
defined (Jiang 2007: 121). Given a column vector of random effects o, the 
Gaussian LMM assumes the observations y, ..., y, are (conditionally) inde- 
pendent such that y; ~ N(x;D + z;o, 12), where the primes denote the matrix/ 
vector transposition operator, x; and z; are known column vectors for the ith 
observation, B is an unknown vector of regression coefficients, and ? is an 
unknown variance. Suppose also that & is multivariate normal with mean 
0 and covariance matrix G, which depends on a vector 0 of unknown vari- 
ance components. Let X and Z be the matrices whose ith rows are x; and z;, 
respectively. It is easy to see that this leads to the LMM of (731) with normal- 
ity and R = “I. 

In the specification of the Gaussian LMM, the two key assumptions are 
(1) conditional independence of the observations (given the random effects) 
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and (2) the distribution of the random effects. These two elements can be 
used to define the class of GLMMs. Given a column vector of random effects 
a, suppose the responses y, ..., y, are (conditionally) independent with con- 
ditional distribution a member of the exponential family of distributions 
with probability distribution function: 


fil oe exp [18 aa (732) 


where b(), oC, c(,) are known functions, and @ is a dispersion parameter 
that may or may not be known. The quantity e, is associated with the condi- 
tional mean u, = E(y;|o), where E is the expectation operator. This conditional 
mean in turn is associated with the linear predictor: 


n; = xíB+ zia (733) 


where x; and z; are known column vectors, and B is a vector of unknown 
regression coefficients (i.e., the fixed effects), through a known link function 
ot) such that 


g(u;) =ù; (7.34) 


As in the LMM class of models, it is assumed that œ ~ N(0, G), where the 
covariance matrix may depend on a vector 0 of unknown variance compo- 
nents. In brief, the GLMMs retain the linear predictor function of the LMMs 
but generalize the family of frequency distributions for the outcome obser- 
vations in such a way they accommodate nonnormal and discrete vari- 
ables. The importance of the GLMM class of models is that members of the 
exponential family of distributions it incorporates include the Gaussian, 
binomial, Poisson, gamma, and inverse-Gaussian distributions. This facili- 
tates application, for example, of the logistic and Poisson link functions, as 
illustrated in Sections 75.2 and 7.6 and Sections 8.2, 8.3, and 8.5 in Chapter 8. 

In the empirical applications of LMMs and GLMMs to the HAPC mod- 
els and analyses described in this chapter, the elements of the y vectors are 
the outcome observations for which an analysis of age, period, and cohort 
effects is sought (e.g., the GSS WORDSUM scores), the elements of the 
X matrices and corresponding x vectors are the individual-level explanatory 
variables for which observations are available in the repeated cross-sectional 
sample surveys and for which corresponding fixed effects coefficients are 
specified in the B vectors, and the elements of the Z matrices and correspond- 
ing z vectors are the specifications of the time periods and cohorts to which 
specific observations belong and for which corresponding random effects 
are specified in the o vectors. 

As noted, the conventional assumption in specifications of LMMs and 
GLMMs is that the random effects and errors are normally distributed, and as 
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noted in the text of this chapter, the estimation algorithm typically used in 
software programs for these models is an ML or REML-EB. Research in sta- 
tistics has shown that, even if the normality assumptions do not hold, the ML 
and REML estimators are consistent and asymptotically normal for point 
estimation of the fixed effects coefficients (in non-Gaussian LMMs, the use 
of normality-based estimators is termed quasi-likelihood; Jiang 2007: 16-17). In 
addition, however, when normality does not hold, the asymptotic covariance 
matrix involves parameters other than the variance components, namely, the 
third and fourth moments of the random effects and errors. This implies 
that standard errors may be incorrectly estimated. Accordingly, in empirical 
applications of LMMs and GLMMs, analysts should assess the plausibility of 
the normal distributional assumptions. If the data are substantially nonnor- 
mal and the number of random effects is relatively small, caution is advised 
regarding inferences about statistical significance as conventional software 
programs often do not include functions of the third and fourth moments in 
their algorithms, and standard errors may be underestimated. 
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Mixed Effects Models: Hierarchical APC- 
Cross-Classified Random Effects Models 
(HAPC-CCREM,, Part II: Advanced Analyses 


8.1 Introduction 


In this chapter, we illustrate applications of the family of hierarchical age- 
period-cohort (HAPC) models for more advanced analysis of repeated cross- 
sectional data designs in the forms of both sample surveys and aggregate 
rates. In addition to a new example using repeated cross-sectional surveys 
from the General Social Survey (GSS) on happiness introduced in Chapter 3, 
we continue to analyze the cancer mortality rate data to relate their temporal 
trends revealed in Chapter 6 to potential risk factor mechanisms. We also 
describe extensions to HAPC models useful for solving substantive prob- 
lems in empirical research. These extensions involve more advanced statis- 
tical methods, such as Bayesian methods and heteroscedastic regressions 
(HRs), and can be skipped by readers with less-advanced statistical method- 
ological knowledge. 


8.2 Level 2 Covariates: Age and Temporal Changes 
in Social Inequalities in Happiness 


If there is evidence from the basic HAPC-CCREM (cross-classified random 
effects model) analysis for clustering effects of random errors or significant 
random variability across birth cohorts, how can it be explained or what may 
account for the variance? The same questions may apply to the investigation of 
period effects. Such problems suggest the importance of explanatory factors 
related to birth cohort and period effects and cannot be fully accommodated 
in the classical APC (age-period-cohort) accounting framework introduced in 
Chapters 5 and 6 (Smith 2004). The HAPC modeling approach, therefore, is 
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an improvement that offers an option for researchers interested in identify- 
ing key explanatory factors in addition to age, period, and cohort indicators. 
The specification of random coefficient HAPC-CCREMs with period-level 
and cohort-level variables to explain the period and cohort effects we have 
estimated in Chapter 7 is a next step in model building and assessment. 

We now illustrate the use of the more elaborate models that combine the 
specifications of random coefficients and level 2 covariates through an analy- 
sis of trends in social inequalities of happiness." Based on the time series data 
on happiness in the United States for people 18 years of age and older from 
the GSS 1972-2004, we conducted an HAPC analysis to address the follow- 
ing questions about changes in the social inequalities of quality of life: Do 
people get happier with age and over time? Are there any birth cohort dif- 
ferences in happiness? How do social inequalities—sex, race, and education 
differentials—in happiness vary over the life course and by time? And, what 
could have contributed to these differences and changes? 

The HAPC-CCREM applied to the happiness data was specified as the 
following: 


Level 1 or ^within-cell" model: 


Pr( HAPPY nix = 1) = Bo jx F B, AGE s B.AGE° + Bs SEX + Ba RACE 


P (8.1) 
+BsEDUC+ A BA, 
p=6 


It states that the cumulative logit of happiness score that has m = 3 ordinal 
response categories (1 = very happy; 2 = pretty happy; 3 = not too happy) 
for respondent i (for i = 1, 2, ..., n) within period j (for j = 1, 2, ..., 22) and 
cohort k (for k = 1, 2, ..., 18 five-year cohorts) is a function of age, age squared, 
sex, race, education, and other correlates X, where X denotes the vector of 
other individual-level variables, including age by sex, age by race, age by 
education interaction terms for the test of hypothesis of differential social 
inequalities over the life course, and control variables (income, marital sta- 
tus, health, employment status, family structure, and religious attendance) 
that are known to be strongly associated with happiness (Blanchflower and 
Oswald 2004; Davis 1984; Easterlin 2001; Hughes and Thomas 1998). In this 
model, Box is the intercept indicating the cell mean for the reference group 
at mean age surveyed in year j and belonging to cohort k; Bu — Bs, and B, 
denote the level 1 coefficients; and P is the maximum number of covariates 
included. Age divided by 10 and all other continuous covariates were grand 
mean centered for ease of interpretation of the intercept values. 


* Parts of Section 8.2 were adapted and updated from Yang, Y. 2008. American Sociological 
Review 73:204—226. 
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Level 2 or “between-cell” model: 


Overall mean: 


Boz = Yo + Uoj + Vok (8.2) 
Sex effect: 
Pajk = Ya + Vsk (8.3) 
Race effect: 
Paj = Ya + Vak (8.4) 


Equation (8.2) is the model for the random intercept that specifies that the 
overall mean varies from period to period and from cohort to cohort, where 
Yo is the expected logit of being very happy at the zero values of all level 1 
variables averaged over all periods and cohorts; uç; is the cohort effect in 
terms of residual random coefficients of cohort j averaged over all time peri- 
ods with variance t,y Vo is the period effect in terms of residual random 
coefficients of period k averaged over all birth cohorts with variance La We 
can constrain the age effects to be fixed just as in the previous analysis of 
verbal test score. The next two equations specify the sex and race coefficients 
to have random period effects represented by residuals v4, and Ou, whose 
random variance components are denoted as T,3 and 1,4, respectively. We also 
tested for random cohort effects for the sex and race coefficients but did not 
find any. Other level 1 covariates are modeled as fixed across level 2 units 
because supplementary analysis did not suggest they have significant ran- 
dom effects. Based on the combined models of Equations (8.1)- (8.4), we next 
estimated ordinal logit CCREMs of happiness using SAS PROC GLIMMIX 
(Littell et al. 2006) (see the sample codes). Probabilities modeled are cumu- 
lated over the lower ordered values based on the assumption of proportional 
odds, and the coefficients indicate the effects for the probability of being very 
happy (1) versus others (2 and 3) and the probability of being very happy or 
pretty happy (1 or 2) versus not too happy (3). 

Table 8.1 presents estimates of fixed effects coefficients in the form of odds 
ratios of being happy and random effects variance components from the 
ordinal logit CCREMs. Model A includes the key social status variables, and 
model B adjusts for control variables. Model A shows a significant age effect 
net of the random period and cohort effects. It suggests that, adjusting for 
time period and birth cohort variations, the odds of being happy increased 
9.8% with every 10 years of age. Adjusting for other covariates in model B, the 
age effect became significantly quadratic, indicating an acceleration of the age 
increase at the rate of 2.7% every decade over the life course. Thus, the first 
important finding is that with age comes happiness. It is important to note 
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that in the HAPC models of happiness where all three temporal factors 
are considered, the age effects dominate, and the period and cohort varia- 
tions are small. This suggests that studies of temporal changes in subjective 
well-being that ignore life course changes may be misleading in giving the 
impression that time trends and cohort differences of happiness are more 
substantial than they actually are. 

Results of other fixed effects in model À show that women, whites, and col- 
lege graduates on average had higher odds of being happy relative to men, 
blacks, and people with less education, respectively. Whereas the sex effect 
was small and only marginally significant, the race and education effects 
were substantial and highly significant. Being black was associated with 47% 
lower odds of being happy. Having college education increased the odds of 
being happy by about 30%, whereas having less than a high school degree 
decreased the odds by 30%. Comparing models A and B showed that the 
salient sex and race differentials in happiness persisted even when major 
correlates of happiness were held constant. But, the race effect decreased in 
size, and the effects of educational attainment were no longer significant. 
This means that some of the effect of being black and all of the effect of edu- 
cation were mediated by other covariates. These results were largely consis- 
tent with previous cross-sectional studies of social correlates of happiness at 
a point in time. But, they bear different interpretations in the HAPC analysis. 
That is, they represent individual-level fixed effects when level 2 heterogene- 
ity by period and cohort are taken into account. 

The interaction terms at level 1 in model A suggest that the sex, race, and 
educational gaps in happiness varied significantly with age. All the inter- 
action terms remained significant when all things were considered in 
model B, so the life course trajectories of happiness still showed appreciably 
different trends. When potential explanatory factors were controlled, how- 
ever, the interaction effects of sex, race, and education with age shrank in 
size. Figure 8.1a shows the predicted probabilities of being very happy by 
age from model B for various social groups at the zero values of all other 
covariates (for the reference group). The age curves of happiness are con- 
vex and J shaped, suggesting that the average probability of being happy 
bottomed out in early adulthood and increased at an increasing rate as 
one moved through the life course. The life course patterns of happiness 
were different, however, for men and women, blacks and whites, and the 
more educated and the less educated. For both blacks and whites, women 
were happier than men, but the gaps showed convergences with age. The 
adjustment for marital status, health, and work status decreased the age-by- 
sex interaction effect. This supports the hypothesis that the changes of the sex 
effect on happiness with age are partly due to increases in widowhood and 
worsening health among women, which decrease happiness, and retirement 
and better health among men, which increase happiness. The racial gap in 
happiness was larger than the sex gap for a large segment of the life course. 
Whites’ advantage over blacks starts large and gradually decreases with age. 
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FIGURE 8.1 
Predicted age variations in sex, race, and education effects on happiness from model B: 
(a) Age*Sex and Age*Race effects; (b) Age*Education effects. 


When other factors were held constant, the race differentials showed rever- 
sals occurring at around the age of 70, after which blacks became slightly 
happier than whites. The age and education interaction effects are graphi- 
cally presented in Figure 8.1b. There appear to be crossovers in age effects of 
happiness among the three educational groups at the age of 50, after which 
the highly educated were not particularly better off than the less educated. 
These convergences indicating loss of advantages for women, whites, and the 
highly educated with age do not seem to be completely explained by marital 
status, health, or other factors. The persistent social disparities in life course 
patterns of happiness invite future investigations of aging-related theories. 
The lower panel of Table 8.1 reports the results of random effects in terms 
of residual variance components for the intercept and level 1 coefficients of 
sex and race. There seemed to be significant period changes in levels of 
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happiness and sex and race inequalities net of age effects, cohort changes, 
and other factors. There were also significant cohort changes in levels of hap- 
piness, although none of the level 1 effect coefficients showed any significant 
random effects by birth cohort. That is, the advantages of women and the 
highly educated and the disadvantages of blacks and the less educated were 
constant across successive cohorts. 

Figure 8.2 displays the estimated random period effects from model A, 
namely, the predicted probabilities of being very happy for each year at the 
mean age and averaged over all birth cohorts by sex and race. The calcula- 
tions of these probabilities followed the same procedure described for the 
National Health and Nutrition Examination Survey (NHANES) obesity 
analysis in Section 77 in Chapter 7. The overall period effects showed a gen- 
eral weak downward trend across the first two decades that was consistent 
with findings from previous studies based on GSS data but also exhibited 
clear nonlinear declines over time. The range of such period changes was 
relatively small, that is, largely within 5% over the 30 years. The sex-specific 
and race-specific results suggest that both sex and race disparities decreased 
during the last 30 years in the United States. Figure 82a illustrates that the 
reduction in the sex gap was due to decreasing levels of happiness in women 
and the stable trend in men as of 1995 and similar increases for both after- 
ward. The sex differentials were relatively small even at the initial time 
period, and they were not statistically significant in the final model when 
other factors were taken into account. Figure 82b illustrates that the black 
and white gap was much more pronounced throughout the 30-year period, 
but there were small declines in more recent years. From the 1970s to the 
mid-1990s, whites experienced a slight downward trend in happiness, and 
blacks experienced some fluctuations early on and a stable trend after 1980. 
Both groups fared better after 1995. Such period changes in racial differences 
remained statistically significant in model B after adjustment of other factors. 

Figure 8.3 shows the random cohort effects in terms of the predicted prob- 
abilities of being very happy at the mean age and averaged over all periods 
estimated from model A for the reference group. As is the case for period 
effects, the cohort effects were small relative to the age effects and showed no 
linear increases or declines. There was barely any evidence in prior studies 
for cohort differences in happiness, although there were speculations that 
more recent cohorts had experienced lower levels of happiness. The results 
do not strongly support such a hypothesis when confounding effects are 
controlled. The cohort trends do not show the monotonic declines that previ- 
ous research had speculated might be engendered by cohorts' value shifts 
under the influence of "postmaterialism." Instead, they were flat for the ear- 
lier and later cohorts. The most intriguing finding is the dip in the predicted 
probabilities of being very happy for cohorts born between 1945 and 1960, 
that is, the baby boomers. 
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FIGURE 8.2 
Predicted period variations in the sex and race disparities in happiness: (a) changing sex dif- 
ferences by period; (b) changing race differences by period. 


The HAPC analysis of happiness suggests that life course patterns, time 
trends, and birth cohort differences in happiness are distinct and independent 
of each other. The significant net age, period, and cohort effects suggest that 
itis important to formally test variations in all three time-related dimensions 
in studies of changes in subjective well-being for adequate interpretation and 
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Predicted cohort variations in happiness from model A. 


valid inference with regard to these effects. The individual-level covariates 
included have accounted for some of the age-related changes in happiness 
and social differentials in age changes. It is not clear what could have con- 
tributed to the period and cohort patterns. Some hypothesized that the for- 
mer could be correlated with changes in macroeconomic variables such as 
gross domestic product from 1970s to 1990s and levels of joblessness (Di Tella, 
MacCulloch, and Oswald 2003). So, does gross domestic product (GDP) buy 
extra happiness? And, how does unemployment rate feed through into people's 
perceptions of quality of life? We can examine these hypotheses through the 
specification of level 2 covariates in the HAPC-CCREM framework. 

The plots of variables associated with economic cycle in Figure 8.4 sug- 
gest that percentage annual change in the GDP per capita and the unem- 
ployment rate seemed to accompany the period changes in happiness. So, 
we can add the two period-level covariates, AGDP, and unemployment rate 
(UNEMPLOY,), to the models for the overall mean, sex, and race effects to 
test whether economic environment explains period variations in happiness 
and sex and race gaps therein. In addition, the cohort analysis proposition 
emphasized the exogenous social demographic environment into which 
cohorts were born and in which they came of age. The baby boomer pattern 
revealed is consistent with the Easterlin hypothesis that large cohort sizes 
have negative consequences on cohort members’ socioeconomic and other 
life course outcomes, such as sense of well-being and mental stress (Easterlin 
1987). Figure 8.5 shows how closely the dip in predicted probabilities of hap- 
piness mirrored the rise in the relative cohort size (RCS) that defines the baby 
boomer cohorts. So, we can also add this cohort characteristic to the model 
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FIGURE 8.5 
Birth and fortune: cohort effects of happiness and relative cohort size (RCS). 


for the overall mean to account for cohort-level variance. The level 2 models 
can now be expanded to the following: 


Level 2 or “between-cell” model with level 2 covariates: 


Overall mean: 


Boje = Yo + Yn AGDP, + Yo.UNEMPLOY; + Yo RCS; + uo; + Vox (8.5) 
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Sex effect: 


Bs — Ys + Y3AGDB, F Y3U NEMPLOY,; + Us (8.6) 


Race effect: 


Bar = Y4 + YaAGDB, + Y 9UUNEMPLOY, + v4; (8.7) 


The model estimates for level 2 covariates added to model B in Table 8.1 
are summarized in Table 8.2. The results on other variables were similar 
to those shown in Table 8.1 and thus are omitted in the interest of space. 
The fixed effect coefficients showed that economic prosperity as indicated 
by percentage increases in GDP per capita during this period was signifi- 
cantly associated with more happiness on average, whereas larger cohort 
size reduced happiness significantly. Economic conditions also affected sex 


TABLE 8.2 
CCREM of Happiness: Period- and Cohort-Level Covariates 
Fixed Effects Parameter Coefficient Odds Ratio p Value 
Model for Mean 
Intercept Yo —0.95 0.39 «0.001 
AGDP Yo 0.02 1.02 «0.001 
UNEMP Yoo 0.00 1.00 0.858 
RCS Yos -0.03 0.97 0.019 
Model for Sex Effect 
Intercept Ké 0.17 1.18 <0.001 
AGDP Y -0.02 0.98 «0.001 
UNEMP Ya -0.03 0.97 0.159 
Model for Race Effect 
Intercept Vs —0.40 0.67 «0.001 
AGDP Ya 0.03 1.03 0.029 
UNEMP Yo 0.08 1.08 0.099 
Variance Component Variance s.e. % Reductiona 
COHORT Za 
Intercept 0.00 0.00 73 
PERIOD 
Intercept Too 0.01 0.00 — 
SEX Tos 0.00 0.00 83 
RACE m 0.02 0.02 38 
Model Fit 
-2 Res Log-Pseudo-Likelihood 231798.9 df = 28,833 


a Compared to the variance estimates from Model B of Table 8.1. 
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and race differences over time. Increases in GDP reduced both sex and race 
differences (the race effect was negative, so a positive coefficient of GDP led 
to a decrease in the black-white gap). A higher unemployment rate, inter- 
estingly, was associated with decreases in the race gap. Accounting for the 
period-level covariates largely reduced the period variances in the sex (83%) 
and race (38%) effects relative to those estimated in model B of Table 8.1. 
Since the residual period effect in the intercept or overall mean remained 
the same, there were other period factors for future research to consider. 
And, accounting for the single cohort characteristic reduces the cohort vari- 
ance by 73%. Therefore, fortunes do seem to be more closely related to early 
life conditions and formative experiences. Larger cohort sizes increased the 
competition to enter schools and the labor market and created more strains 
to achieve expected economic success and family life. The unique experi- 
ences of these cohorts during early adulthood thus can have a lasting impact 
on their sense of happiness. 


8.5 HAPC-CCREM Analysis of Aggregate Rate Data 
on Cancer Incidence and Mortality 


The examples introduced so far are from sample surveys of individual 
respondents nested within time periods and birth cohorts. We mentioned 
in Chapter 3 that the same repeated cross-sectional design of aggregate 
rate data can also become multilevel given the existence of finer-grained 
age-specific data within each period and cohort group. We now turn to the 
HAPC-CCREM analysis of the cancer incidence and mortality data on which 
the Intrinsic Estimator (IE) analysis was based in Chapter 5. We address 
three areas that have not been addressed before: model validation, social 
disparities in cancer trends, and period- and cohort-related mechanisms. 


8.3.1 Trends in Age, Period, and Cohort Variations: 
Comparison with the IE Analysis 


Are the IE estimates of age patterns and temporal trends of cancer inci- 
dence and mortality rates consistent with those obtained from the applica- 
tion of the HAPC-CCREM? Consistency of results across different models 
provides evidence for model validation in addition to the statistical means 
through simulations. We have already shown strong support for the valid- 
ity of the IE method through the analysis of the microdata from the GSS on 
verbal ability in Chapter 5. We now broaden the evidentiary base by apply- 
ing the HAPC models to aggregate rate data from different cancer sites 
and outcomes. We estimated Poisson CCREM using SAS PROC GLIMMIX. 
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Tables 8.3 to 8.6 present the CCREM estimates of fixed effects of age (mean 
centered) and random variance components of cohort and period effects 
for lung, colorectal, breast, and prostate cancer incidence and mortality 
rates, respectively. Results from model A show highly significant and non- 
linear age effects for all outcomes examined. We plot the predicted age- 
specific rates of incidence and mortality in Figure 8.6 (with cohort and 
period covariates in Figure 8.7) and Figures 88-810. The specific shapes 
of the age curves correspond to the parametric forms we imposed on the 
age effects and may not be directly compared to those of the IE results. 
While we also fit the models using dummy indicators of age groups, we 
chose the current functional forms of age effects based on parsimony and 
test of significance. 

The cohort effects indicated by random cohort variances for the intercept 
are highly significant for both incidence and mortality of all four cancers. 
The plots of predicted rates by cohort using the random cohort coefficients 
also document similar cohort trends as those shown in the IE analysis. For 
example, lung cancer incidence and mortality results showed first increas- 
ing and then decreasing cohort effects for all groups, with delayed peaks in 
female cohorts. Cohort declines in mortality are observed for all other cancer 
sites, and those in incidence apply to most but not all cohorts. The period 
effects indicated by random period variances for the intercept were much 
smaller and less significant than the cohort effects for all cancer sites and 
outcomes except for prostate cancer incidence. This is also evident from the 
inspection of the predicted rates by period. As shown in the IE analysis, the 
period effects were relatively flat and showed slight variations that differed 
in trends by subgroups. 

In all, we find highly compatible estimates of A, P, and C effects from the 
IE and HAPC analyses of incidence and mortality related to major cancer 
sites in the United States. While the former estimation method applies to 
the conventional linear model that uses a special constraint to identify the 
model, the latter method does not incur the identification problem and uses 
no constraint. The convergence in the results from the applications of these 
two different methods thus validates our findings on the cancer trends that 
serve as the foundation for further investigations of social differentials and 
explanatory factors. 


8.3.2 Sex and Race Differentials 


The IE method is for the APC accounting models that do not accommodate 
additional covariates. Therefore, the previous analysis of cancer trends was 
conducted on stratified groups of population by sex and race. Although the 
findings showed some qualitative differences in the A, P, and C effects for 
men and women, blacks and whites, they do not constitute quantitative evi- 
dence for real group differences due to the lack of a statistical test of the 
sex and race effects. We now employ the HAPC models to test the group 
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FIGURE 8.6 
Age, period, and cohort variations in lung cancer incidence and mortality rates by sex and race. 


differences or the hypothesis of social disparities in aging and cancer links 
and time period and birth cohort variations in cancer. 

We entered sex and race variables to the level 1 model and tested both their 
main effects (the intercepts), their interaction effects with age, and the inter- 
action between the two themselves. At level 2, we specify random coefficient 
models to test the cohort and period variations in the sex and race effect 
coefficients. The fixed effect estimates in model A showed substantial sex 
differences in lung and colorectal cancer incidence and mortality rates that 
indicated significant male excesses. There were also pronounced race differ- 
ences in cancer incidence and mortality rates that indicated black excesses 
in all but one case. Black females showed significantly lower relative risks 
of breast cancer incidence, which is consistent with epidemiologic studies of 
racial disparities in breast cancer (Chlebowski et al. 2005). The age coeffi- 
cients in the models for sex and race effects represent the interaction terms 
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FIGURE 8.7 
Cohort and period covariates: smoking, obesity, HRT, and mammography. 


between age and sex and between age and race, and they suggest that the 
sex and race gaps in cancer incidence and mortality varied significantly with 
age. Figures 8.6 and 8.8-8.10, accordingly, show different age trajectories of 
cancer outcomes for men and women in the two race categories. Both lung 
cancer and colorectal cancer incidence and mortality rates showed diverg- 
ing sex gaps with age. For the case of lung cancer, race gaps grew with age 
in women and remained largely constant in men. For the case of colorectal 
cancer, race gaps decreased with age in both sexes. The results for breast and 
prostate cancer outcomes were similar regarding age changes in race differ- 
ences. The black and white gaps in the incidence rates increased until the 
age of 75 and slightly decreased in oldest ages, whereas the black excesses in 
mortality rates continued to grow throughout the life course. 

Sex and race disparities also showed cohort and period variations for all 
cancer sites examined. The random variance components in Tables 8.3-8.6 for 
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Age, period, and cohort variations in colorectal cancer incidence and mortality rates by sex 
and race. 


the sex and race effects are sizable and significant in most cases, particularly 
for cohort variance components. Model A of lung cancer incidence and mor- 
tality, for example, showed large cohort variances in the sex effect that were 
highly significant. The cohort trends in lung cancer incidence and mortality 
rates in Figure 8.6 further show the male and female differences across cohorts 
that reflect the sex-specific cohort patterns of cigarette smoking. Period vari- 
ances in the sex differences were also pronounced for lung cancer incidence 
rates, which showed decreases over time for males but increases over time 
for females. Racial disparities across cohorts and periods were much larger 
for males than females. The results for colorectal cancer outcomes in Table 8.4 
showed larger and more significant cohort than period differences in the sex 
and race gaps. There is evidence in Figure 8.8 for decreasing racial gaps in 
most recent cohorts in both incidence and mortality rates. 
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Age, period, and cohort variations in breast cancer incidence and mortality rates by race. 


Tables 8.5 and 8.6 suggest that race differences in breast and prostate cancer 
incidence rates varied significantly by cohort and period, while those in mor- 
tality rates varied only by cohort. Figure 89 shows that the white excesses in 
incidence of breast cancer decreased in cohorts born after 1960 and the last 
10 years. The black disadvantage in breast cancer survival increased signifi- 
cantly across cohorts due to the increases in black female mortality rates and 
decreases in white female mortality rates for cohorts born between 1910 and 
1950. While both black and white female cohorts born after 1950 experienced 
declines, the racial gaps remained substantial. Figure 8.10 shows increases in 
the racial gaps in prostate cancer incidence rates in the most recent cohorts 
and years. It also shows much larger black disadvantages in survival than 
incidence rates. While these disadvantages decreased across cohorts, they 
remained constant over time, as indicated by the lack of significance in the 
period variance component for the race effect. 
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Age, period, and cohort variations in prostate cancer incidence and mortality rates by race. 


8.3.3 Cohort and Period Mechanisms: Cigarette Smoking, Obesity, 
Hormone Replacement Therapy, and Mammography 


Analyses using the classical APC accounting models of aggregate rates only 
concern temporal trends that may be attributed to cohort- or period-related 
changes. While a large body of demographic, aging, and epidemiologic lit- 
erature has developed theories about how cohort effects arise, as we summa- 
rized in Chapter 2, the test of cohort-based mechanisms cannot be conducted 
within the framework of the linear models. Similarly, explanations of period 
effects are usually drawn from correspondences of changes in rates and 
social demographic or epidemiological conditions that are expected to influ- 
ence these rates. The question of how hypothesized period- or cohort-related 
factors actually explain period or cohort effects is beyond the scope of the 
simple accounting models (Smith 2004). Instead, it requires more fine-grained 
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regression analyses using a different kind of model that can accommodate 
period- and cohort-level covariates. We now include several such covariates 
(cigarette smoking, obesity, hormone replacement therapy [HRT], mammog- 
raphy) that have been established in cancer epidemiologic research as impor- 
tant risk factors for the four cancer sites under investigation. 

The CCREM analysis of lung cancer incidence and mortality in Table 8.3, 
model A, suggests significant cohort effects on the intercept, sex, and race 
coefficients. We entered one cohort-level covariate of smoking (as defined 
in Chapter 3) in model B to test how differential cohort exposures to smok- 
ing can account for the cohort effects on the overall incidence and mortality 
rates and also cohort variations in the sex and race gaps in these rates. The 
estimated fixed effects for the cohort smoking variable and its interactions 
with race effects were highly significant in the models of both incidence and 
mortality. The cohort smoking and sex interaction effect was not significant 
in the mortality model. Because only the linear terms of level 2 covariates 
were included in this and subsequent analyses, their fixed effects may not 
be significant if the underlying correlations are nonlinear. Comparisons of 
sex-specific cohort lung cancer mortality in Figure 8.6 and cohort smoking 
exposures in Figure 8.7a suggest that they closely resemble each other in 
nonlinear changes—increases first followed by decreases. The inclusion of 
the cohort smoking variable largely reduced the cohort variances for the 
intercept (by 9870) and sex coefficient (by 95%) for the incidence model. 

We tested the three variables of period smoking prevalence (current smok- 
ers, former smokers, and smoking cessation) and found largely similar results 
for the first two and less-strong effects for the third. Therefore, we presented 
the estimates associated with the prevalence of current smokers in model B 
for this and subsequent analyses when smoking is considered a major carci- 
nogenic factor. The results from model A suggest significant period effects 
for the intercept and sex coefficient in the incidence model. Figure 8.7b docu- 
ments downward trends of smoking prevalence rates during this period for 
males, less-steep declines for white females, and increases in black females 
before the 1990s. Model B adjusted for period smoking prevalence and 
showed it was significantly related to lung cancer incidence and sex differ- 
ences therein, net of all the other factors. Figure 8.6 shows the declining male 
incidence rates and slightly increasing female incidence rates, reflecting the 
lagged effects of period changes in smoking for each group. The random 
variance components also showed that it accounts for most of the period 
variations in sex gaps (by 93%). The residual cohort variance for the inter- 
cept of mortality and period variance for the intercept of incidence remained 
significant or increased in size, suggesting the need to search for additional 
cohort-related explanations for the inverse-U-shaped cohort effects on lung 
cancer mortality and period-specific explanations such as diagnostic changes 
for lung cancer incidence patterns during this time. 

In the CCREM analysis of colorectal cancer shown in Table 8.4, we adjusted 
for cohort and period covariates of both smoking and obesity prevalence 
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rates in model B. The fixed effects estimates showed that both cohort smok- 
ing and obesity played significant roles in affecting the overall colorectal 
cancer incidence and mortality rates. The cohort declines in smoking shown 
in Figure 8.7a correspond well to the cohort declines in colorectal cancer inci- 
dence and mortality rates shown in Figure 8.8. Cohort smoking was also sig- 
nificantly related to sex differences in both incidence and mortality and race 
differences in incidence. The increases in incidence in more recent cohorts 
corresponded to the increasing cohort prevalence of obesity documented in 
Figure 8.7c. Cohort obesity was also significantly related to sex differences 
in mortality and race differences in both incidence and mortality. The inclu- 
sion of these two cohort covariates largely reduced the cohort variance com- 
ponents for all random coefficients except for the intercept in the mortality 
model. Period smoking and obesity changes (in Figure 8.7d) significantly 
affected period changes in colorectal cancer incidence rates and race dif- 
ferences in mortality rates. The period random variances were reduced in 
size and significance level after adjusting for period-level covariates in both 
the incidence and mortality models. The only significant residual variance 
is that associated with the intercept for the incidence, which may be further 
explained by period changes in colonoscopy usage. 

The results for breast cancer outcomes are reported in model B in Table 8.5. 
One cohort-level covariate of obesity was entered to both the incidence and 
mortality models and two period-level covariates, HRT and mammography, 
were entered in the incidence model (as they did not show significant asso- 
ciations with mortality even individually considered). Although these level 2 
covariates showed significant fixed effects in bivariate analyses (not shown), 
only period mammography change was significant in main effect when 
all other factors were adjusted. Both period covariates were significantly 
related to race differentials in breast cancer incidence. And, they completely 
explained the period variances in both the intercept and race coefficient as 
they were no longer significant in model B. Based on these results, we con- 
clude that period increases in breast cancer incidence illustrated in Figure 8.9 
were entirely due to increases in the mammography usage and hence detec- 
tions of malignant cases as shown in Figure 8.7e. The downturn in incidence 
rates starting in the year 2000 in white females, on the other hand, was a 
result of sudden reductions of HRT usage, as illustrated in Figure 8.7f, and 
narrowed the racial gaps during this period. Net of all other factors, cohort 
Obesity was significantly related to race differentials in breast cancer mortal- 
ity, and its inclusion reduced the cohort variance of the race coefficient. This 
means that the increases in the black mortality disadvantage for cohorts born 
before 1950 and persistent black excess for cohorts born after 1950 resulted in 
part from higher rates of obesity in these cohorts of black females. 

Table 8.6 presents the final set of analyses on prostate cancer in model B, 
which takes into account cohort and period smoking variables. Because the 
trends in period smoking prevalence (Figure 8.7b) and period effects of inci- 
dence (Figure 8.10) were in the opposite direction, the fixed effect estimate 
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of the period smoking variable is negative. This should not be interpreted 
as evidence for negative or harmful effects of smoking declines on prostate 
cancer incidence. This provides one example that cautions against simple- 
minded causal inferences of the effects of level 2 factors on temporal trends 
in rates and suggests the importance of exploratory data analysis of spe- 
cific factors to be used as explanatory variables. Period smoking prevalence 
explains a substantial proportion of the period variances in incidence rates 
and some period variances in race differentials in incidence rates. Cohort 
smoking patterns contributed largely to cohort declines in prostate cancer 
mortality and race differentials therein. While the cohort variance in the race 
effect was no longer significant after the adjustment of smoking exposure, 
that in the intercept remained significant. 

It is interesting to note that the same cohort smoking variable that has been 
used as a proxy variable to take the place of cohort effects in previous demo- 
graphic studies of mortality (Preston and Wang 2006) did not seem to account 
for all cohort effects of mortality in our analysis. We examined cohort smok- 
ing patterns in relation to cohort effects of mortality from all four cancer 
sites. They did not play any significant role in breast cancer mortality. The 
residual cohort variances remained significant after adjusting for the cohort 
smoking variable in the HAPC models of lung, colorectal, and prostate can- 
cer mortality. Because smoking is by far a more important risk factor for can- 
cer than other causes of death, it is unlikely that cohort patterns of smoking 
can successfully capture all cohort-related variances in mortality. It follows 
that substituting cohort effects with this or other proxy measures is subject 
to model specification errors long suspected by earlier studies (Glenn 2005; 
Smith, Mason, and Fienberg 1982). The HAPC-CCREM analysis has thus pro- 
vided an instrument to test the plausibility of any chosen proxy variables. 

In sum, the HAPC-CCREM analysis moves far beyond simply identify- 
ing age, period, and cohort trends. It allows us to see how social stratifi- 
cation operates over the life course and historical time and to test the extent 
to which aging and cohort theories apply to the outcome of interest in ways 
that cannot be achieved by a conventional linear regression analysis. This 
analytic technique opens new doors to research on time-related change by 
offering a systematic and flexible modeling strategy that takes advantage 
of the multilevel data structure and covariates in repeated cross-sectional 
research designs. The first prominent feature of the HAPC model is that it 
does not incur the identification problem because the age, period, and cohort 
effects are not assumed to be linear and additive. It represents one family 
of nonadditive models that can be extremely useful for capturing the con- 
textual effects of cohort membership and period events on a wide range of 
social demographic processes. The second contribution of the hierarchical 
models approach to APC analysis is that it makes it possible to test explana- 
tory hypotheses by incorporating covariates provided by sample surveys or 


Hierarchical APC-Cross-Classified Random Effects Models, Part II 261 


other sources and yields insights to substantive cohort or period phenomena 
that can hardly be gained from any version of the APC accounting models. 


8.4 Full Bayesian Estimation 


HAPC models can be extended in various ways to address other research 
questions. We introduce two extensions. This section focuses on a full 
Bayesian approach that can be used to refine the estimates of mixed effects 
within the HAPC-CCREM framework. The next section introduces a vari- 
ance function regression (VFR) model that can be used jointly with mixed 
effects models to extend the HAPC-CCREM framework for the understand- 
ing of between-group and within-group heterogeneity. 

Using the example of the GSS verbal scores, we show how to apply methods 
of Bayesian statistical inference to HAPC models of repeated cross-sectional 
survey data. Building on our previous illustrations of the HAPC analysis 
of GSS verbal ability trends, we further examined situations that may affect 
the accuracy of the mixed effects estimates based on the restricted maxi- 
mum likelihood-empirical Bayes (REML-EB) estimation method. In APC 
analyses of finite time period survey data, the numbers of periods and birth 
cohorts usually are too small to satisfy the large-sample criteria required by 
the maximum likelihood (ML) estimation of variance components. In addi- 
tion, the sample sizes within each cohort are highly unbalanced. Therefore, 
errors in variance components estimates may produce extra uncertainty 
in coefficient estimates that will not be reflected in the standard errors. 
This added uncertainty may cast doubt on statistical inferences based on 
REML-EB estimates of model parameters (Raudenbush and Bryk 2002). It 
also motivates the investigation of a full Bayesian alternative to account for 
the extra uncertainty brought about by the small sample sizes and unbal- 
anced data. We show how the HAPC models approach can be implemented 
via the Bayes-MCMC (Monte Carlo Markov chain) methods and how they 
may improve statistical inferences for APC analyses. 


8.4.1 REML-EB Estimation 


Using the same GSS verbal ability data between 1972 and 2000 as in Yang 
(2006), we specify HAPC-CCREM with two additional cohort covariates: pro- 
portion of cohort members who read newspapers daily (NEWS) and mean 
hours of TV watching per day (TV). Both were thought to be related to the 


* Section 8.4 was adapted from Yang, Y. 2006. Sociological Methodology 36:39-74. 
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decline in cohort mean verbal test scores since the early twentieth century 
(Glenn 1992). The new level 2 model (6.32) examines the associations between 
these two cohort characteristics and trends in verbal ability in the presence 
of controls for individual-level factors related to verbal ability in the com- 
bined model (6.33): 


Level 2 or “between-cell” model: 


Boj = Yo +YyıNEWS; + y;TV; + Uoj + Vok, Uoj ad N(0,1,), Vok ~ N(0,T,) (8.8) 


Combined model: 


WORDSUM x = Yo + i NEWS; + y,TV; +B: AGE; +B;AGE2, 
(8.9) 
E B3EDUCATION jx + BuaFEMALE ix + BsBLACK ix + Uoj + Out + E ik 


fori=1,2,..., Nix individuals within cohort j and period k; j = 1, ..., 19 birth 
cohorts; and k = 1, ..., 15 time periods (survey years). 

For any possible value of y, where y is the vector of fixed effect coeffi- 
cients, a likelihood of variance components T, T, and o? can be defined, say 
L(t,,,1,,07|7,Y), where Y is the observed data. Averaging over all possible val- 
ues of y for the likelihood yields a likelihood of, t,, T, and o? given Y alone. 
This is the restricted likelihood, L(t,,1,,07|Y). The REML approach chooses as 
estimates of T,, T, and o?, and those values that maximize the joint likelihood 
of these parameters given Y. Conditioning on these ML estimates, one com- 
putes generalized least squares estimates of y and empirical Bayes estimates 
of the level 1 coefficients B. 

REML-EB estimates obtained using SAS PROC MIXED are presented 
in the left panel of Table 8.7. Cohort effects of newspaper reading and TV 
watching were significant after controlling for individual-level covariates; 
the residual variations between cohorts and between periods are close to 
zero, compared to the unconditional variance estimates (0.139 and 0.031), so 
these level 2 variance components were substantially reduced. 

This approach has several strengths. First, the REML estimates of variance 
components are consistent and efficient in large samples De, for large J and 
K). In addition, for large J and K, the sampling distributions of these estima- 
tors are approximately normal, provided that the REML estimates oft, and z, 
are positive definite. It follows that for large J and K, the normal distribution 
can be conveniently used for constructing confidence intervals and hypothe- 
ses testing. Third, since the EB estimates depend on REML estimates of vari- 
ance components, assuming large J and K, the good large-sample properties 
of the REML estimates give strength to inference about B. 

The limitations of the REML-EB approach are the following: First, the esti- 
mates of z, and t, may be quite inaccurate in small samples. The estimates 
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TABLE 8.7 


Alternative Estimates of Coefficients and Summary Statistics: 
REML vs. Bayes Via Gibbs under Diffuse-Normal-Gamma Priors 


REML Bayes via Gibbs 
Fixed Effects Parameter Coefficient SE Coefficient SE 
Intercept Yo 6.20 0.05 6.20 0.06 
NEWS Yı 1.40 0.47 1.39 0.49 
TV *; —0.26 0.12 —0.26 0.13 
AGE B. 0.02 0.02 0.02 0.02 
AGE p. —0.06 0.01 —0.06 0.01 
EDUCATION Bs 0.38 0.01 0.38 0.01 
SEX (1=female) Ba 0.24 0.03 0.24 0.03 
RACE (1=black) B. —1.05 0.04 —1.05 0.04 
Random Variance 
COHORT Lu 0.02 0.01 0.03 0.01 
PERIOD Tw 0.00 0.00 0.00 0.00 
Individual o? 3.14 0.03 3.14 0.03 
Model Fit 
Deviance 77666.6 77627.9 
DE/pD* 1 30.3 
AIC/DIC® 77688.6 77658.3 


Source: Adapted from Yang (2006: Table 2). 
^ DF for REML and pD as the effective number of parameters for Bayes. 
P AIC for REML and DIC for Bayes. 


of the variances in z, and t, may not have normal distributions for small 
J and K. If the data are unbalanced, estimates of will depend on weights 
that depend on REML estimates so that random variation in these estimates 
will lead to uncertainty about y that will not be reflected in standard errors. 
Furthermore, errors in the REML estimates will result in extra uncertainty 
in the EB estimates of B that will not be reflected in the standard errors. 
Therefore, the confidence intervals for y and B will be shorter, and tests of 
significance will be more liberal than they should be. For a more detailed 
exposition, see Raudenbush and Bryk (2002: Chapter 13). 

In the hierarchical APC analysis of interest here, the number of periods 
and birth cohorts are usually too small to satisfy the large-sample criteria 
required by the REML estimations of variance components. In addition, 
the sample sizes within each higher-level unit De, cohort and period) are 
highly unbalanced, especially for cohorts. There tend to be very few sample 
members for the oldest and youngest birth cohorts, as shown previously in 
Table 3.4. In this particular dataset, J = 19, and n; vary across J, with the small- 
est cohorts having fewer than 100 individuals. In this case, estimates of EB 
coefficients will depend on weights that are functions of the ML estimates 
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of variance components. Therefore, errors in variance components esti- 
mates due to small J and K may also produce extra uncertainty in the EB 
estimates that will not be reflected in the standard errors. Again, tests for 
these point estimates may be too liberal. This problem tends to diminish 
when either n, or n, is large, as in most cohorts and survey years in this exam- 
ple; however, for small J/K and small n; for some cohorts, this dependence of 
EB estimates on REML estimates is undesirable. 

Yang (2006) addressed the question about the adequacy of REML estima- 
tors when J and K are small and data are unbalanced through a number of 
Monte Carlo simulations to examine how the HAPC model would perform 
under the REML assumptions. The Monte Carlo simulations showed that, 
in relatively large samples (large J and K), the REML-EB estimators of the 
HAPC model generally perform well in terms of producing numeric esti- 
mates of the population parameters they are intended to measure. The inter- 
val estimates, however, should be used with caution in both large and small 
samples. It will be useful to have methods for producing more accurate esti- 
mates with small samples of survey years and birth cohorts and unbalanced 
data. This is where Bayesian MCMC estimation fits in. In the following, we 
show a full Bayesian approach to estimation of HAPC model parameters 
that, by definition, ensures that inference about every parameter fully takes 
into account the uncertainty associated with all others. 


8.4.2 Gibbs Sampling and MCMC Estimation 
In a Bayesian formulation of HAPC models applied to the repeated cross 
sections, one combines the likelihood based on the data with prior informa- 
tion about the fixed and random parameters via prior distributions. Based on 
Equations (77), (8.8), and (8.9), the Bayesian model for the verbal ability data 
can be summarized as the following: 

Level 1 model (likelihood): 

f(Y loi 
Level 2 model (stage 1 prior): 
p | Y, Tu, lo) 
Stage 2 prior: 
port, To, G°) = p(y)p(z,)p(z,)p(o°) 


In this formulation, each unknown parameter is viewed as a random 
variable that arises from certain probability distribution. The probability 
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distribution p quantifies the uncertainty about the parameter. Bayes's rule 
then yields the joint posterior distribution: 


p(B, Y, uu T.O |Y) s J | p,o^)p(p | WE To pL Tu, To,0°) * 


Marginal posterior distribution can be derived by integration of the 
Joint posterior with regard to each parameter. Therefore, in the Bayesian 
approach, the inference about every parameter fully accounts for the uncer- 
tainty of other parameters through the use of conditionaland joint probabili- 
ties. Point and interval estimates can then be obtained based on the exact 
posterior distribution. 

Gibbs sampling is used as an accurate computational approach to numeri- 
cal integration. The Gibbs algorithm for the HAPC model is as follows: 


p(B, Y, Tu, 1,07 | Y) = py(B | y, Tur Tor 0°, YY, tu, TO | Y) 
= Py(¥ | Tan 0°, B, Y)r (t, 1,07, B |Y) 
= Pu(tu | to, 07, B, y, Y)r (x, 07, B, Y | Y) 
= p.t, | o^, B, Y, Tu, Y)r (o7, B, v, v, | Y) 


= p (0° |¥,B, Tu, To, Yr (YB, TuTo | Y) 


Gibbs sampling is based on the fact that the posterior density of all 
unknowns can be approximated even though the corresponding densities 
r have unknown forms. Starting with initial values '/9, 1, 1, and o, it 
repeatedly samples from the conditional distributions until stochastic conver- 
gence. The chain of values generated by this sampling procedure is known as 
a Monte Carlo Markov chain (MCMC). An example of the conditional distribu- 
tions needed for Gibbs sampling was provided by Yang (2006: Appendix A). 
On convergence, m iterations are obtained, and the empirical distribution of 
these m values of the unknowns may be regarded as an approximation to the 
true joint posterior. For large m, the marginal posterior may be approximated 
by the empirical distribution of the m values produced by the Gibbs. For this 
study, m = 20,000 for one MCMC chain (see the description that follows). 

The benefits of the Bayesian methods typically come at a price: the required 
choice of prior distributions for model parameters. The topic of the choice of 
priors is a vast one (see, e.g., Draper 2002). A common assumption is that dif- 
fuse priors can be used in an attempt to base information solely on the data 
when no prior knowledge is available for the parameters of interest. This 
means that a prior can be used that allows the data to dominate the determi- 
nation of the posterior distribution. Furthermore, the ever-increasing usage 
of Bayes-MCMC methods in statistical practice has produced several classes 
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of priors that may be viewed as sufficiently “noninformative” in fairly broad 
settings (Guo and Carlin 2004). We implemented Bayesian modeling through 
MCMC methods via the WinBUGS (Bayesian inference using Gibbs sampling 
running under Windows) software package freely available on the Internet. 
Since no previous sociological studies on verbal ability informed the specifi- 
cation of priors, to facilitate comparison of the REML-EB and Bayesian analy- 
ses, the study here selected noninformative proper prior distributions’ with 
hyperparameter values chosen so that the priors exerted minimal impact rel- 
ative to the data (Gelman et al. 2000; Raudenbush and Bryk 2002). Conjugate 
priors! were specified that were comprised of a normal prior for y and an 
inverse gamma prior for the variance components (Gelfand et al. 1990; Carlin 
and Louis 2000): 


Prior: 
p(y) ~ N(0, 1.0E - 6) 
p(1/x,,) ~ Gamma(0.001, 0.001); p(1/t.) ~ Gamma(0.001, 0.001) 


(1/0?) ~ Gamma(0.001, 0.001) 
H 


The priors were chosen so that the Bayesian analysis was similar to a cor- 
responding likelihood analysis (REML-EB). The difference is that likeli- 
hoods were now adjusted and interpreted as probability distributions on 
the parameters. 

Ordinary least squares (OLS) and REML estimates provided the initial 
values for two parallel MCMC sampling chains of 20,000 iterations each, 
following a 5,000-iteration "burn-in" period. The Gelman and Rubin tests 
were used to diagnose effective convergence. A general discussion of MCMC 
convergence monitoring is available in the work of Carlin and Louis (2000). 
Each chain was thinned by 10 to reduce autocorrelation among the param- 
eter estimates. The summary statistics were calculated after the achievement 
of a less-correlated sample. The final sample size for posterior inferences was 
4,000 with 2,000 per chain. The WinBUGS code for fitting the Bayesian hierar- 
chical APC model 3 is provided in the online companion page. Convergence 
diagnostics and output analysis (CODA) and results for parameter estimates 
are available in the work of Yang (2006: Appendices B and C). 


` In general, a prior density p(0) is proper if it integrates to 1. If a prior density is improper, 
the integral of p(0) is infinity and violates the assumption that probabilities sum to 1. 
Although improper priors sometimes can lead to proper posteriors, this is not always the 
case. Posteriors obtained from improper priors must be interpreted with great caution—one 
must always check that the posteriors have finite integrals and sensible forms (Gelman et al. 
2000: 53). 

* Selecting priors from a distribution family that is conjugate to the likelihood leads to a pos- 
terior distribution belonging to the same distributional family as the prior. This can be com- 
putationally convenient. For more details, see the work of Gelman et al. (2000: 37). 
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Residual cohort random variance and period random variance estimates: (a) cohort effect uj; 
(b) period effect oo. 


For comparison, the Bayes MCMC estimates are given side by side with 
the REML results for each model in Table 8.7. Note that SAS PROC MIXED 
delivers solutions for the model parameters only in terms of point estimates 
of the posterior means and associated asymptotic standard error estimates, 
whereas Bayes MCMC methods provide exact posterior inference for each 
node that includes the posterior mean estimate, the MCMC standard devia- 
tion as the estimate for the standard error, Monte Carlo error, median, and 
quantiles. Table 8.7 shows that the fixed coefficients were very similar for 
the REML and MCMC estimates, with the estimated standard errors slightly 
larger in some cases for the MCMC results. The main difference between 
the REML and MCMC estimates was in the level 2 variances. The Bayes 
via Gibbs estimators produced larger estimates of cohort and period vari- 
ances and standard errors. This is due to the Bayes estimates taking into 
account the uncertainty brought about by the small numbers of cohorts and 
periods. For Bayesian model selection, the deviance information criterion 
(DIC) (Spiegelhalter et al. 2002) is a hierarchical modeling generalization of 
Akaike's information criterion (AIC) and is readily available in WinBUGS. 

The cohort and period residual random effects estimates were also com- 
pared for REML-EB and Bayes MCMC results. Figure 8.11 plots the posterior 
means and interval estimates, based on Table 8.7 of the residual cohort effect 
Hor and the residual period effect Gu, respectively. Figure 8.11a shows that for 
both EB and Bayes estimates, the intervals were the widest for the youngest 
and oldest cohorts, which had relatively small sizes (n). Both Figure 8.11a 
and 8.11b show that while the mean estimates were quite close, the Bayes 


` This is an estimate of the difference between the mean of the sample values (which is used as 
the estimate of the posterior mean for each parameter) and the true posterior mean. As a rule 
of thumb, an MC error that is less than about 5% of the sample standard deviation indicates 
convergence of chains. 
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credible intervals (see, e.g., Carlin and Louis 2000 for definitions) were wider 
than the EB intervals of variance components. 


8.4.3 Discussion and Summary 


In applying the Bayesian method to the estimation of HAPC-CCREM, one 
can take further steps to assess how the choice of priors may affect posteri- 
ors. An example of prior sensitivity analysis based on these data was given 
by Yang (2006) that formulated alternative priors to compare the results with 
those based on former priors. It showed that the choice of priors had small 
effects on parameter estimates that depend on small numbers of level 2 units 
(coefficients of newspaper reading and TV watching, cohort and period 
variance components), whereas the effect was negligible on those based on 
the total sample size. This suggests that the full Bayesian HAPC analysis of 
other repeated cross-sectional datasets with similar numbers of cohort and 
periods would also benefit from a prior sensitivity analysis. 

We have shown how Bayesian inferences can be used for cross-classified 
random effects models applied to the vocabulary data from the GSS. We 
compared the empirical results obtained through two parameter estimation 
methods, namely, the REML-EB method and the full Bayes method using 
Gibbs sampling. The point estimates were nearly identical. However, the 
Bayes 95% credible intervals were wider in most cases than were the inter- 
vals based on REML. This increase occurred because the REML intervals did 
not reflect the uncertainty associated with regression coefficients when vari- 
ance components were unknown, and the REML estimates of these variance 
components were not accurate because of the small sample sizes at cohort 
and period levels. The Bayes credible intervals reflected this extra uncer- 
tainty and therefore strengthened the inference. The empirical findings on 
fixed coefficients, variance components, and random effects were generally 
not highly sensitive to the imposition of alternative priors considered in 
the study. In sum, the analytic results shown can be interpreted with more 
confidence than any previous findings of this subject and therefore help to 
resolve the inconsistencies across studies. The substantive findings on age, 
period, and cohort trends and individual-level covariates were the same as 
those shown previously. A new finding is that the decline of vocabulary for 
more recent/younger cohorts born after 1950 was closely associated with the 
decreased percentage of cohort members who read a newspaper daily and 
increased cohort mean hours of TV watching. This cohort effect was not con- 
founded with the aging or period effect. 

The evaluation of the performance of REML and full Bayes estimators in 
the current application was restricted to well-behaved response variables 
in the context of HAPC models of repeated cross-sectional sample surveys. 
Although the results based on the two methods are not drastically conflict- 
ing, the Bayesian estimators generally showed improvement relative to the 
REML estimators. There were other cases where the divergence between 
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the maximum likelihood and Bayes results was larger. The problem of inac- 
curate inference using the REML estimators can be most significant when 
random effects are relatively large, data are sparse and highly unbalanced, 
and response variables are not normally distributed. 

The other advantage of using APC models based on generalized linear 
mixed models is that it can be naturally extended to more sophisticated mul- 
tilevel data analysis. Two improvements can be made in future research. 
First, one can add more levels to the hierarchical model by giving priors 
to the stage 2 hyperparameters that include the fixed coefficients at level 2, 
cohort, period, and individual variance components. The higher-level mod- 
els can further estimate their uncertainties that may arise from probabil- 
ity distributions. Conjugate hyperpriors such as chi-squared distributions 
can be used for computational convenience. Studies can also be found in 
recent biostatistics literature that used autoregressive prior models (Bashir 
and Esteve 2001) and random walk smoothing models (Knorr-Held and 
Rainer 2001) for Bayesian prediction of vital rates. Second, it may be of ana- 
lysts' interest to examine effects of other social units. For example, recent 
reports of the National Assessment of Educational Progress documented 
pronounced regional differences in academic test scores, including read- 
ing scores in the United States (e.g, Weiss et al. 2002). If state-level verbal 
test score data are integrated into sample surveys, it is convenient to move 
beyond the individual-level data using the HAPC models to estimate geo- 
graphic variations in academic performances. 


8.5 HAPC-Variance Function Regression 


In addition to the study of social demographic change through the APC anal- 
ysis, another long-standing core analytic tool of social science is the study of 
inequality through regression models (Blau and Duncan 1966; Morris and 
Western 1999). The use of regression-based models in APC analysis closely 
relates the study of cohort change to the study of inequality both substan- 
tively and methodologically. The analytic tools for the examination of age 
and temporal (ie. period and cohort) variations in inequalities, however, 
have been mostly restricted to those for the conditional means of regression 
models.’ We now embed a variance function regression (VFR) model within 
an HAPC analysis. This facilitates the decomposition of not only between- 
group inequality into age, period, and cohort components De, variations in 
the conditional mean of an outcome across age, period, and cohort) but also 
a similar APC decomposition of within-group inequality Oe, variations in the 


` Parts of Section 8.4 were adapted from Zheng, H., Y. Yang, and K. C. Land. 2011. American 
Sociological Review 76:955—983. 
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conditional variance or dispersion of an outcome across age, period, and 
cohort). More generally, the combined model allows the integration of theo- 
ries of social stratification and social change and opens the door to a better, 
more comprehensive understanding of the dynamics and heterogeneity of 
social processes by which individual lives unfold over the life course and are 
shaped by historical time and social context represented by cohort member- 
ship. We draw on a recent study published by Zheng, Yang, and Land (2011) to 
show how to use the combined model with an example of health disparities. 


8.5.1 Variance Function Regression: A Brief Overview 


Standard regression-based approaches to studies of inequality are largely 
limited to between-group differences. Group here means each category of a 
covariate. For example, "gender" has two groups: men and women. Between- 
group inequality in this case is the inequality between men and women. 
Within-group inequality in this case is the remaining inequality within the 
population of men or within the population of women. It often is the case 
that between-group inequality is far exceeded by within-group or residual 
inequality. Residual inequality often is considered as due to measurement 
error or the influence of unobserved or hidden heterogeneity. A recently 
developed class of statistical models, termed variance function regression 
(VFR) by Western and Bloome (2009) and, more generally, heteroscedastic 
regression (HR) in statistics (Smyth 1989) can be used to address this limita- 
tion by simultaneously modeling both the mean and the variance of an out- 
come variable as functions of covariates and hence takes into account both 
between-group and within-group differences. This class of models explicitly 
targets the residual variance in regression models for analysis. 

Conventional linear regression models assume that the residual error 
terms of the models are independently and identically distributed with con- 
stant or homoscedastic variance, and especially important for small samples 
for which asymptotic statistical properties of estimators do not apply, that 
the errors have normal probability distributions (see, e.g., Fox 2008: 187-219). 
Violations of these assumptions affect estimators of the standard errors of 
regression coefficients and reduce the statistical efficiency of conventional 
least squares estimators. A variety of statistical methods has been devel- 
oped for diagnosing and correcting nonconstant error variances (Fox 2008: 
272-277). The key feature of VFR/HR is that it treats violations of homosce- 
dasticity as more than a data problem that needs to be corrected to obtain 
well-behaved estimators. It rather approaches these violations as being of 
potential substantive importance and builds regression models to account 
for them. In applications to the study of inequality, such as Western, Bloome, 
and Percheski's (2008) study of trends in family income inequality in the 
United States over the years 1975 to 2005, the residual variance can be inter- 
preted as measuring within-group risk or insecurity. 
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This and the HAPC models and the substantive questions they address are 
related. One often needs to understand sources of social inequality attribut- 
able to age, time period, and birth cohort in that the APC variations repre- 
sent the complex and temporal patterns of inequality. In his seminal article 
(1965), Ryder argued that cohort membership is a structural category just as 
socioeconomic status (SES) that has explanatory power. To the extent that 
SES inequalities are defined by both between- and within-group differences 
(Western, Bloome and Percheski 2008), social inequalities by age, period, 
and cohort should also be assessed in terms of between-group differences 
and within-group dispersions. Differences in either term can bring about 
subsequent social demographic change at the population level. Therefore, 
an integrated model is useful for decomposing these two differences across 
age, period, and cohort. The VFR/HR models have been used to examine 
inequalities in both terms but have not distinguished the age, period, and 
cohort sources of temporal variation. The HAPC regression models have 
been used to examine temporal differences in conditional means but not 
within-group variances. We now illustrate the utility of the intersection of 
these two modeling frameworks with an application to the analysis of health 
disparities in the United States over the years 1984 to 2007. We discuss useful 
findings from this analysis and make general observations on the utility and 
potential for advances in empirical analyses of inequality of this synthesis of 
analytical models. 


8.5.2 Research Topic: Changing Health Disparities 


We use the term health disparities to refer to either between-group or within- 
group differences in health and distinguish the two aspects of inequality 
in specific circumstances. In the context of APC analysis, groups are defined 
by the age, time period, and cohort categories. Between-group health dispari- 
ties refer to the variations in the conditional mean (conditional on a set of 
individual-level sociodemographic variables) of health across age, period 
and cohort. Within-group health disparities refer to the conditional variance or 
dispersion of health within each category of age, period, or cohort. Changes 
in within-group health disparities refer to the variations in the conditional vari- 
ance or dispersion of health across age, period, or cohort. 

In addition to a large body of demographic and epidemiologic research 
as well as that included in this book on age variation and temporal trends 
in health and mortality that has addressed between-group health dispari- 
ties, there are three standard approaches to the study of changes in within- 
group health disparities: (1) across the life course, (2) across cohorts, and (3) across 
time periods. The empirical studies illustrated in Chapters 7 and 8 in this 
book gave samples of all three. Within each approach, there is evidence 
for significant change in health disparities. For example, the gaps in lung, 
colorectal, and prostate cancer incidence and mortality by sex and race have 
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widened with age (Figures 8.6, 8.8, 8.10). Gaps in self-rated health, physical 
functioning, well-being, disease incidence, and mortality by education lev- 
els have widened over the life course (e.g., Ross and Wu 1996; Lauderdale 
2001). The gaps in lung, breast, and prostate cancer mortality by sex and race 
have widened and declined across cohorts (Figures 8.6, 89, and 8.10). While 
the intracohort gaps by sex and race have been constant across birth cohorts 
(Yang and Lee 2009), the gap in self-rated health by education levels has wid- 
ened across birth cohorts (Chen et al. 2010; Lynch 2003). There was also evi- 
dence of increasing race disparities in cancer incidence and mortality over 
time in our previous analysis and socioeconomic inequality in health, dis- 
ability, and life expectancy in the United States in the past several decades 
(e.g., Hummer, Rogers, and Eberstein 1998; Meara, Richards, and Cutler 2008; 
Liu and Hummer 2008). Increasing sex, race, and SES inequalities across the 
life course, birth cohorts, and time periods conceivably contribute to increas- 
ing overall inequality or dispersion across these dimensions. This, however, 
cannot be examined using either the previous HAPC or the VFR analysis 
and merits a formal test using properly constructed analytic models. 
Wenow presenta method that facilitates the disentanglement of age, period, 
and cohort variations in health disparities defined by differences in both 
conditional mean levels and conditional dispersions of health Ge, between- 
group and within-group health disparities). To do so, we intersect the HAPC 
model with the VFR/HR (VFR thereafter) model. The HAPC model enables 
us to disentangle age, period, and cohort effects. The VFR model enabled us 
to separate within-group from between-group health disparities. We inter- 
sect these two statistical models and term the result an HAPC-VFR model. 


8.5.3 Intersecting the HAPC and VFR Models 


VFR/HR (see Zheng, Yang, and Land 2011 for a review of various types of 
such models) has two parts, including a regression for an outcome variable Y; 
and a regression for logarithm of the residual variances log(o?) (Western and 
Bloome 2009): 


Y; = Bo + BiX1i + Ba Kat + BeXpi + ei (8.10) 
log(6;) = Ao + ÀZ; + ÀZ; +...+ÀRZRi (8.11) 


where observations on individual sample members are indexed by i, X,, X, 
..., Xp is a set of P explanatory variables for Y, Z,, Z;, ..., Za is a set of R 
explanatory variables (possibly equal to X,, X» ..., Xp) for the logarithm of 
the residual variance log(o?), with residual random error term e; for Y;. The 
quantity o? is the square of the corresponding residuals 6? from the first 
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regression. From a substantive viewpoint, the first regression describes how 
covariates affect the Y; response variable and account for the deviations of the 
within-group sample means from the average or grand mean Y (which can 
be termed the between-group inequality), while the second regression explains 
how covariates affect the within-group variability of the response variable 
around the group means (which can be termed the within-group inequality). 

We integrate the VFR model with the HAPC model by using the HAPC 
model to estimate the two equations in the VFR model, treating cohort and 
period as random effects in the context of a repeated cross-sectional survey 
research design across a broad range of ages—so that the question of the rel- 
ative contributions of the age, time period, and birth cohort temporal dimen- 
sions are relevant. To do this, we engage in a two-step estimation algorithm. 

Step 1: Estimate the f) Regression Coefficient Vectors for Between- 
Group Inequality across Age, Period, and Cohort 

We use the REML estimator of the CCREM to estimate Equation (7.2) of 
the VFR model. The algebra for this algorithm is represented in Section 72 
on the basic HAPC model specification. This step produces a set of estimates 
of fixed effects coefficients (for the individual-level explanatory covariates), 
random effects coefficients (for cohorts and periods), and a random variance 
components matrix that evaluates the contributions of these individual-level 
and period and cohort contextual variables to the explanation of variance in 
the conditional expected value or conditional mean of the outcome variable. 
In the context of this analysis on health, this step estimates the variations in 
the conditional mean of self-rated health across period, cohort, age, and other 
individual-level covariates. 

Step 2: Estimate the A Regression Coefficient Vectors for Within-Group 
Inequality across Age, Period, and Cohort 

We next calculate the residuals (Cig = = Yik -X'a) from the Step 1 regres- 
sion, for each sample respondent i, and compute the squared residuals e a or 
denoted as o: œ We then apply the residual pseudolikelihood (RSPL)’ estima- 
tor of the CCREM to estimate Equation (8.11) of the VFR model. For normal 
distributed errors, the squared residuals will have a gamma distribution, and 
Equation (8.11) then is estimated in generalized linear mixed model form— 


* We used the SAS PROC MIXED and PROC GLIMMIX procedures to estimate the first- and 
second-step regressions, respectively. The default estimator of GLIMMIX is RSPL, which 
maximizes the residual log pseudolikelihood and provides unbiased predictors of the ran- 
dom effects. The pseudomaximum likelihood estimator uses a consistent and asymptotically 
normal estimator rather than a maximum likelihood estimator for the variance parameters. 
In models for a normally distributed outcome variable with an identity link, RSPL is equiva- 
lent to REML (Littell et al. 2006), but RSPL is consistent and asymptotically normally distrib- 
uted for nonnormal data as well. 
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as a gamma regression of o on the X;, using a log link function (Western 
and Bloome 2009: 300; see also Nelder and Lee 1991)" 
The algebra for this algorithm can be stated as follows: 


Level 1 or ^within-cell" model: 


log(05) = Aan + MX tik F NX vik SE. F À pX pij, (8.12) 


Level 2 or “between-cell” model: 


Aur — Tto +Moj + Mor, Oo; ~ NU, Yu), Qor ~ N(0, Wo) (8.13) 


Combined or mixed effects model: 


log(o%x) = To + Au Kin F Än Kam Fef À pX pij, F Oo; sk Dok (8.14) 


where Au is the intercept or “cell mean,” that is, the mean log(o?) of indi- 
viduals who belong to birth cohort j and were surveyed in year k; n, is the 
expected mean of log(o?) at the zero values of all level 1 variables averaged 
over all periods and cohorts; ou and @, are the residual random effects of 
cohort j and period k, respectively, assumed normally distributed with mean 
0 and variance y, and y,. In addition, Au = To + (3; is the cohort log(o?) score 
averaged over all periods with all individual-level covariates at grand mean 
level; and Au = To + Pox is the period log(o?) score averaged over all cohorts 
with all individual-level covariates at grand mean level. 

This step produces a set of estimated fixed effects coefficients (for the 
individual-level explanatory covariates) random effects coefficients (for 
cohorts and periods), and a random variance components matrix that evalu- 
ates the contributions of these variables to the explanation of variance in 
the logarithm of the residual variances log(o2) for each sample respondent i. 
In the context of this study, this step estimates the variations in the vari- 
ance or dispersion of self-rated health across period, cohort, age, and other 
individual-level covariates. It merits emphasizing here that the predicted o? 
for age, period, or cohort represents a general form of dispersion of health 
across age, period, or cohort, whereas previous empirical research in health 
disparities focused on specific inequality by one or two dimensions, such as 


` The outcome variable in the analyses described in the following material, self-rated health, 
is not normally distributed; it is skewed to the left. The residuals calculated from the Step 1 
regression have a symmetric distribution that has short tails compared to a normal distribu- 
tion. Because of the very large sample size, estimates of the coefficients of the Step 2 regres- 
sion still have good statistical properties. This is due to the fact that, for independently and 
identically distributed data, the RSPL method produces estimates of the fixed and random 
effects of a mixed model that are consistent and asymptotically normally distributed; even 
the identically distributed assumption can be relaxed (Demidenko 2004: 647). 
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sex, race, or SES. An increase or decrease in any one of the specific dimen- 
sions will contribute to the increase or decrease in the general dispersion, 
which however has not been studied in the current literature. Therefore, 
instead of examining the changing effects of each specific dimension on 
health, this analysis investigated how the general dispersion of self-rated 
health may change across age, period, and cohort. 

Even though each of these two steps produces REML or RSPL estimators 
from the CCREM, it must be iterated to obtain ML estimators for the VFR 
model (Aitkin 1987). As Western and Bloome (2009: 301) indicated, the fitted 
values (65,) from an application of the two steps should be saved and used 
in a weighted regression of Vo, on Ai, X», ..., Arno With weights (1 /6,,). 
Estimates of the residuals from Step 1 then are updated, Step 2 is computed, 
and so forth until convergence. Western and Bloome (2009: 301) noted that 
the ML estimator may perform poorly in small samples, in which case a 
REML or Bayes estimator can be used. In the empirical application described 
in the following material, however, the sample sizes were very large, so the 
adjustments in the REML made for the loss of degrees of freedom resulting 
from estimation of the regression parameters would be very small, if not 
trivial. Therefore, for purposes of the empirical application of the HAPC-VFR 
model in this chapter, the ML estimator was applied. Sample codes for sub- 
sequent analyses implementing the HAPC-VFR model are available in the 
online companion. 


8.5.4 Results: Variations in Health and Health Disparities 
by Age, Period, and Cohort, 1984-2007 


Table 8.8 reports estimates of parameters, standard errors, and model fit 
statistics for the HAPC-VFR models of self-rated health in National Health 
Interview Survey (NHIS) data from 1984 to 2007. The “B” column presents 
the results for the first-stage regression of the HAPC-VFR model (which esti- 
mated variations in mean health across groups), and the "A" column presents 
the results for the second-stage regression of the HAPC-VFR model (which 
estimated variations in dispersion of health across groups). 

These results are for the sample as a whole. There was substantial evidence 
of major gender differences in temporal trends in health disparities in prior 
research. Therefore, we also investigated variations in gender-specific self- 
rated health disparities across age, time period, and cohort. In other words, 
we investigated how health disparities may change across age, time period, 
and cohort within each gender-specific sample. We present the results from 
this last stratified analysis in subsequent figures only. 

As shown in the "D" column, consistent with findings from previous stud- 
ies, being male, white, married, and more educated and having a job and more 
income were associated with better self-rated health. The effects of age were 
curvilinear (quadratic) in that the self-rated health declined with age and then 
began to increase in late life. The 1995 NHIS sample redesign significantly 
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TABLE 8.8 
Estimated HAPC-VFR Models of Self-Rated Health, NHIS, 1984-2007 
p A 
Fixed Effects Coefficient se Coefficient se 
Intercept 3.28 *** 0.01 0.4 *** 0.02 
AGE —0.14*** 0.00 0.07*** 0.01 
AGE2 0.03 *** 0.00 -0.04*** 0.00 
SEX (12male) 0.03 *** 0.00 -0.01* 0.01 
RACE (1=white) 0.17*** 0.00 -0.08 *** 0.00 
MARRIED 0.02 *** 0.00 —0.02 *** 0.00 
EDUCATION 0.06 *** 0.00 -0.03 *** 0.00 
EMPLOYED 0.39 *** 0.00 —0.34*** 0.00 
INCOME/10000 0.07*** 0.00 —0.04*** 0.00 
REDESIGN —0.06 *** 0.01 -0.07 ** 0.01 
REDESIGN*MALE —0.02 *** 0.01 0.02 * 0.01 
Random Effects Coefficient se Coefficient se 
Cohort 
1899 0.03* 0.01 0.20 *** 0.03 
1905 -0.01 0.01 0.12*** 0.03 
1910 -0.01 0.01 0.03 0.03 
1915 0.00 0.01 0.00 0.03 
1920 —0.01 0.01 —0.02 0.02 
1925 -0.02 ** 0.01 -0.03 0.02 
1930 -0.01 0.01 -0.02 0.02 
1935 0.00 0.01 -0.03 0.02 
1940 0.00 0.01 -0.04 0.02 
1945 —0.01 0.01 —-0.07** 0.02 
1950 0.01 0.01 -0.09 *** 0.02 
1955 0.02 *** 0.01 —0.10 *** 0.02 
1960 0.02*** 0.01 —0.10 *** 0.02 
1965 0.00 0.01 -0.07** 0.02 
1970 -0.01 0.01 -0.04 0.03 
1975 -0.03 *** 0.01 0.04 0.03 
1980 -0.01 0.01 0.08 ** 0.03 
1985 0.01 0.01 0.13*** 0.03 
Period 
1984 —0.01 0.01 0.02** 0.01 
1985 -0.01 0.01 0.01 0.01 
1986 0.01 0.01 —0.01 0.01 
1987 -0.01 0.01 -0.01 0.01 
1988 -0.01 0.01 0.01 0.01 
1989 0.01 0.01 —0.01 0.01 


1990 0.02** 0.01 0.00 0.01 
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TABLE 8.8 (continued) 
Estimated HAPC-VFR Models of Self-Rated Health, NHIS, 1984-2007 


p A 
Random Effects Coefficient se Coefficient se 
1991 0.02* 0.01 —0.01 0.01 
1992 0.00 0.01 -0.01 0.01 
1993 -0.01 0.01 0.01 0.01 
1994 0.00 0.01 0.00 0.01 
1995 -0.01 0.01 0.00 0.01 
1996 0.00 0.01 0.00 0.01 
1997 0.03 *** 0.01 0.00 0.01 
1998 0.03 ** 0.01 -0.01 0.01 
1999 0.03 ** 0.01 0.00 0.01 
2000 0.02 0.01 0.00 0.01 
2001 0.01 0.01 0.00 0.01 
2002 0.00 0.01 0.00 0.01 
2003 0.00 0.01 0.00 0.01 
2004 -0.02** 0.01 —0.01 0.01 
2005 —0.02 ** 0.01 0.00 0.01 
2006 —0.01 0.01 0.00 0.01 
2007 -0.05 *** 0.01 0.01 0.01 
Variance Components Variance se Variance se 
COHORT 0.00* 0.00 0.01** 0.003 
PERIOD 0.00 ** 0.00 0.00 0.000 
Model Fit 
BIC 1941250.0 
-2 Res Log Pseudo-Likelihood 2351732.0 


Source: Adapted from Zheng et al. (2011: Table 2). 
Note: * p «0.05; ** p < 0.01; *** p < 0.001. 


decreased the expected value of self-rated health for about 0.06 points for 
women and 0.08 points for men. The estimates of residual variance com- 
ponents at level 2 indicated significant period and cohort effects net of the 
effects of individual-level covariates, while the period effect was larger than 
the cohort effect as reported in the “Variance Components” section. 

Figure 8.12 presents predicted age curves and estimates of cohort and 
period effects on mean self-rated health from the HAPC part of the inte- 
grated model estimated for the gender-specific samples. The top panel shows 
that both men’s and women’s self-rated health declined with increasing age, 
but the trends reversed after around age 69 for men and age 72 for women. 
It can be seen that men reported better self-rated health than women at all 
ages. The gender gap in health was largest in the early adult years, narrowed 
until around age 61, and widened afterward. 
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FIGURE 8.12 
Variations in conditional expected values of gender-specific self-rated health across age, 
cohort, and period, with 95% confidence intervals. 


The estimated cohort effects were relatively flat across cohorts for men, 
except that baby boomers born between 1950 and 1959 significantly had 
better self-rated health than other cohorts. By comparison, the conditional 
expected values of self-rated health changed dramatically across cohorts for 
women. They continued declining from the 1899-1904 cohort to the early 
baby boomers born in 1945-1954 and then rose for the middle and late baby 
boomers and afterward, which resulted in a widened and then narrowed 
self-rated health gap between men and women. In addition, before 1998, the 
period-to-period changes in self-rated health exhibited a very slight increase 
accompanied by cycles up and down, with a significant decline after 1998. 
The estimates in Table 8.8 as well as the gender-stratified analysis suggest 
period effects contributed slightly more than cohort effects to the changes in 
self-rated health from 1984 to 2007 for both men and women. 

As a key output of the VFR part of the integrated model, the "A" column 
in Table 8.8 shows how individual-level covariates affected within-group 
health disparities. The estimated within-group health disparities for males, 
whites, married persons, the more highly educated, employed individu- 
als, and those more income were smaller than those of their counterparts, 
that is, females, blacks, unmarried persons, the less educated, unemployed 
individuals, and those with less income. In addition, the integrated HAPC- 
VFR model yielded estimates of expected or predicted variations in health 
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FIGURE 8.13 
Variations in predicted dispersion of gender-specific self-rated health across age, cohort, and 
period, with 95% confidence intervals. 


disparities across age, period, and cohort (or within-age, within-period, and 
within-cohort health disparities). The estimates of residual variance compo- 
nents at level 2 indicate significant cohort and nonsignificant period effects 
net of the effects of individual-level covariates as reported in the “Variance 
Components” section. Graphs of estimated age, period, and cohort effects 
from the gender-stratified analysis are shown in Figure 8.13. The estimated 
within-age health disparities had a bell shape that peaked around age 56 
for both men and women. After controlling for demographic and socioeco- 
nomic statuses, estimated health disparities in the young adult ages were 
relatively small, indicating that most everyone was relatively healthy. But, 
health disparities increased with age and later declined with age. Compared 
to men, the within-age health distributions for women were slightly more 
spread out. The dispersions were larger than those of men at all ages. They 
increased at a slower rate before age 56 and decreased at a slower rate after 
age 56. 

Figure 8.13 also shows that for women, within-cohort health disparities 
decreased from the 1899-1904 cohort to the 1930-1934 cohort, increased in 
cohorts born in the late stage of the Great Depression and World War II, fol- 
lowed by decreases in baby boomer cohorts and increases in recent cohorts. 
For men, within-cohort variances fluctuated more across cohorts. They 
decreased from the 1899-1904 cohort to the 1915-1919 cohort and then were 
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relatively flat until they substantially declined again in cohorts born in the 
Great Depression, World War II, and baby boomer cohorts born between 
1945 and 1959 followed by substantial increases afterward, especially in 
more recent cohorts. After controlling for individual-level covariates and age 
and cohort effects, the estimates of within-period health disparities graphed 
in Figure 8.13 are very flat between 1984 and 2007 for both men and women. 
The lack of significant random effects coefficients for period and the statis- 
tically insignificant variance components of the period effects in Table 8.8 
suggest the variance in self-rated health did not significantly vary across 
periods. These gender-specific analyses further support the inference that 
cohort effects contributed more than period effects to the changes in health 
disparities. The striking gender difference in the changes in within-cohort 
health disparities merits further research. 


8.5.5 Summary 


The core idea of the HAPC-VFR model is to estimate mixed (fixed and ran- 
dom) effects regression specifications of the two equations in the VFR model 
as a function of age, period, and cohort and other individual-level covariates, 
treating cohort and period as random effects. Thus, the first mixed effects 
regression described how age, period, and cohort affected mean of self-rated 
health net of a set of individual-level covariates (gender, race, marital status, 
work status, education, and income); the second mixed effects regression 
explained how within-group health disparities changed across age, period, 
and cohort. The differences in group-specific means examined in the first 
step were the topic of study in prior studies of health status and the basic 
HAPC model. It was the detection of these temporal changes in within-group 
variations and their decomposition into age, period, and cohort components 
that were made possible by the integrated HAPC-VFR model. 

By contrast to what we found for the conditional mean of self-rated health, 
cohort effects appeared to contribute much more than period effects to 
the changes in the variance of self-rated health over the past two decades. 
Within-cohort health disparities generally decreased from the 1899-1904 
cohort to the baby boomer cohorts and have substantially increased for post- 
baby boomer cohorts. As post-baby boomer cohorts (especially cohorts born 
after 1980) had much larger within-cohort health disparities than preceding 
cohorts, and within-age health disparities increased with age until around 
age 55, it can be expected that health disparities in the general population 
will further increase in the next one or even two decades as these cohorts age 
and replace preceding cohorts. The enlarged health disparities for the post- 
baby boomer cohorts are alarming, and smaller within-cohort heterogeneity 
in men than women is intriguing. These findings should stimulate future 
studies on the underlying mechanisms and explanations that are readily 
testable using the HAPC-VFR model. 
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The nature of the self-rated health outcome variable—in the form of five 
ordered response categories (poor, fair, good, very good, and excellent)— 
complicates the specification and estimation of the combined HAPC-VFR 
model. This model was described in a linear mixed effects regression format. 
For several reasons, we applied this specification to the NHIS data by scaling 
the self-rated health outcome variable as a five-point scale with responses 
numbered from 1 to 5. First, this choice facilitated comparisons with prior 
research using similar self-rated health data. Second, the equal-intervals 
assumption of the five-point scale was, in fact, a good specification for the 
self-rated health responses in the NHIS data. Evidence of this was obtained 
from an ordered logit regression analysis of this outcome variable. But, it is 
still possible that this model was not fully sensitive to either the ordered cat- 
egorical nature of the survey responses to the self-rated question or the non- 
normal frequency distribution of the five-point scaling of these responses. 
Accordingly, Zheng, Yang, and Land (2011) described additional analyses to 
assess the robustness of the empirical findings reported from application of 
this model, including alternative coding of the response variable and repli- 
cation of empirical findings using a different dataset. They also described 
some methodological extensions to adapt it to the ordered nature of the self- 
reported health outcome variable. 

By using the HAPC-VFR model, we have been able to give a more complete 
picture of the evolution of changes in self-rated health and health disparities 
in the United States from 1984 to 2007. Two significant contributions to the 
health disparities literature are notable. First, health disparities across age, 
time period, and birth cohort were intertwined but have not been system- 
atically disentangled in the existent studies due to the lack of an integrated 
model. By using the HAPC-VFR model, this study demonstrated that changes 
in self-rated health disparities in the last two decades have been much more 
of a cohort than a period story. Therefore, further research in this area should 
pay more attention to the "cohort" perspective. Second, prior literature has 
focused on the changes in health inequality defined by specific aspects of 
social stratification system such as gender, race, and SES across age, period, 
or cohort, without an overall picture of the changes in general dispersion of 
health across these three dimensions. The HAPC-VFR model offers an analytic 
tool to capture the general dispersion of health across these three dimensions, 
which provides the basis for further research to examine the contribution to 
the general dispersion by each specific aspect of social stratification system. 

Inequality or disparity in statuses occurs in many domains of social life 
(e.g., income, wealth, education, and heath care access, to name but a few). 
The HAPC-VFR model provides a powerful framework and lens through 
which to identify and study the evolution of variations and social inequali- 
ties in these outcomes across the age, period, and cohort temporal dimen- 
sions. Accordingly, this model should be broadly applicable to the study of 
social inequality in many different substantive contexts. 
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9 


Mixed Effects Models: Hierarchical 
APC-Growth Curve Analysis 
of Prospective Cohort Data 


9.1 Introduction 


The repeated cross-sectional data designs for which generalized linear 
mixed models (GLMMs) in the form of hierarchical age-period-cohort 
(HAPC) models were specified and estimated in Chapters 7 and 8 rely on 
information from synthetic cohorts that contain different cohort members at 
each point in time. That is, the cohorts defined in these designs are synthetic 
in the sense that they contain different individuals as opposed to the same 
group of individuals at each calendar period. Inferences drawn from such 
designs about cohort and age effects therefore rest on the assumptions that 
synthetic cohorts mimic true cohorts and changes over time across synthetic 
cohort members mimic the age trajectories of change within true cohorts. If 
the composition of cohorts does not change significantly over time due to 
migration or other factors and sample sizes are large, these assumptions are 
generally met, and the demographic tool of synthetic cohorts is most useful 
in the absence of information from true cohorts. 

In the life course and human development research and life sciences that 
focus on age-related changes, however, researchers often need to rely on 
longitudinal data obtained from the same persons followed over time to 
track continuity and change within individual lives. The accelerated longitu- 
dinal panel design—wherein multiple birth cohorts are followed over multiple 
points in time—is an important advance in aging and cohort research. The 
advantage of this design is that it not only provides cross-time linkages within 
individuals and hence information pertaining to true birth cohorts but also 
allows a more rapid accumulation of information on age and cohort effects 
than a single cohort follow-up study. In this chapter, we develop a similar 
GLMM approach to the analysis of prospective panel data using accelerated 
longitudinal cohort designs. We include empirical applications that continue 
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to reveal patterns and mechanisms underlying social stratification of aging 
and health. 

Recent social and epidemiologic research on health disparities has been 
perplexed by inconsistent findings on whether these disparities grow or 
diminish over the life course. One key issue that has contributed to such 
inconsistency is the confounding of age and cohort changes. The essen- 
tial assumption of previous studies of the relationship between aging and 
health that omit cohort analysis is that differences in age patterns of health 
observed from cross-sectional data represent true intraindividual develop- 
mental trajectories of change over time that are equal across various birth 
cohorts. This assumption may not be tenable in light of the social changes 
and vastly different historical and life experiences of birth cohorts during 
the twentieth century that bear important consequences for cohort differ- 
ences in mental and physical health. To understand the mechanisms gener- 
ating social inequalities in health over the life course, one must take the role 
of cohort change into account and systematically examine both intercohort 
and intracohort variations in health trajectories. 

We show how to use GLMMs to disentangle the effects of aging and birth 
cohort in longitudinal panel study designs. We examine three questions 
essentialto the understanding of independent age and cohort effects. First, are 
there intercohort variations in aging experiences? That is, are the age-related 
changes specific to birth cohorts? If the answer is yes, then the conventional 
approach to testing age-related changes by omitting the cohort variations 
confounds age and cohort effects. Second, are there intracohort variations in 
the age trajectories? That is, within each birth cohort, do individuals with 
different characteristics such as socioeconomic status (SES) show different 
age-related changes? While the first question treats cohorts as homogeneous 
groups, this second question further examines within-cohort heterogeneity 
and is especially useful for testing aging-related hypotheses. Third, are there 
intercohort variations in intracohort differences in aging? This question 
jointly examines heterogeneities between and within cohorts and informs 
us of how individual aging experiences are shaped or conditioned by social, 
historical, and epidemiologic contexts. Drawing on previous studies of two 
longitudinal datasets, the Americans' Changing Lives (ACL) (Yang and Lee 
2009) and Health and Retirement Survey (HRS) (Yang and Lee 2010), we next 
address these questions analytically to fully capture the processes generat- 
ing social inequalities over the life course.’ 


* Parts of Sections 9.2 and 9.4 were adapted from Yang and Lee (2009); parts of Section 9.3 were 
adapted and updated from Yang, Y., and L. C. Lee. 2010. Journal of Gerontology: Social Sciences 
65B:246-255. 
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9.2 Intercohort Variations in Age Trajectories 
9.2.1 Hypothesis 


We have learned from previous studies of others and our own included in 
the previous chapters that social demographic and historical changes have 
had profound and enduring effects on cohorts' health and mortality risk. 
They support the contention that throughout the twentieth century, indi- 
viduals' health capital—physiological robustness and capacity of vital organ 
systems—has improved with the year of birth, with more recent cohorts 
faring substantially better in their initial endowments at birth and having 
lower depreciation rates in that stock of health capital (Fogel 2004). Improved 
physiological capacities in later cohorts also bode well for the effectiveness of 
medical treatments. In fact, recent demographic research showed that better 
nutrition and reduced inflammatory-infection in utero and during infancy 
have reduced the risks for major chronic diseases in adulthood (Barker 1998) 
and led to less-severe disabilities for successive birth cohorts (Crimmins, 
Reynolds, and Saito 1999). Improvements in physical functioning in more 
recent cohorts are also likely given evidence of continuous cohort improve- 
ments in education, general health, quality of life, and smoking cessation 
(Haug and Folmar 1986; Hughes and O'Rand 2004; Pampel 2005). In contrast, 
early life experiences such as childhood poverty and traumas associated 
with the Great Depression and world wars of the twentieth century have 
been shown to negatively affect the mental and physical health of earlier 
cohorts (Elder 1999; O'Rand and Hamil-Luker 2005). 

On the other hand, it is possible that the health improvements across cohorts 
are constrained to some extent by economic, cultural, and lifestyle changes. 
For instance, there have been substantial changes in family structures, such 
as decreases in marriages and increases in divorces in more recent cohorts 
(Popenoe 1993; Waite 1995). The larger sizes of the baby boom cohorts also are 
associated with increased life stress and decreased happiness in adulthood 
(Easterlin 1987; Yang 2008). In addition, there have been evident increases in 
lifestyles harmful to cardiovascular health and increases in obesity in later- 
born cohorts (Cabrera et al. 2003; Flegal et al. 2002; Reither, Hauser, and Yang 
2009). All these risk factors may have dampened the positive cohort effects 
on health. Studies of depressive symptoms showed evidence of more depres- 
sion in war babies (1935-1945) than in earlier cohorts (Kasen et al. 2003; Yang 
2007). Earlier cohorts (1900-1905 and pre-1900 cohorts) have also been found 
to have better perceived or self-rated health in late life than their successors 
(1906-1917 cohort) (Idler 1993). Because there are both positive and delete- 
rious forces affecting the life courses of more recent cohorts, an empirical 
question is whether the cohort trend of improving health can be generalized 
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to the entire adult life course and to cohorts born after World War II such as 
the baby boomers. 

In light of the many social and historical forces that could have produced 
significant cohort differences in aging experiences, we test the hypothesis 
of intercohort change—that there are substantial intercohort variations in 
health trajectories over the life course, with more recent cohorts having bet- 
ter health on average (higher mean levels of health) and lower growth rates 
of health problems with age. 


9.2.2 Model Specification 


We have shown in a previous chapter how GLMMs can be flexibly applied to 
model a variety of cohort-related phenomena using repeated cross-sectional 
data. GLMMs continue to be a useful tool for modeling longitudinal data. 
Because repeated observations over time can be viewed as level 1 units 
nested within individuals who are level 2 units, one can specify two-level 
hierarchical models in the form of growth curve models to assess individual 
change. Growth curve models have been widely used in longitudinal stud- 
ies of individual change (Raudenbush and Bryk 2002). However, we employ 
such models differently from the way they often are used when a single 
cohort is followed over time, as, for example, in test scores of students in 
schools. Specifically, it will become clear in the models we specify in this 
chapter that the growth curve models applied to multiple cohort panel data 
are special cases of the HAPC models. Although they are distinct from the 
HAPC-CCREMs (cross-classified random effects models), they are hierar- 
chical models that explicitly incorporate cohort effects and implicitly incor- 
porate period effects (in the form of age-by-cohort interaction). Therefore, 
we term them hierarchical APC-growth curve models (HAPC-GCMs). In 
the context of the accelerated longitudinal design depicted by Figure 3.3, we 
then can conduct HAPC-GCM analyses of intracohort age changes (across 
columns) and intercohort differences (across rows) simultaneously. 

The HAPC-GCM specification for the ACL data is shown in Equations 
(9.1)- (9.4). 


Level 1 repeated observation model: 


Yi = Boi + BuiAges + B. Ages ten (9.1) 


where Y, is one of the health response variables (Center for Epidemiologic 
Studies Depression Scale [CES-D] score, functional disability, or self-rated 
health) for respondent í at time t, for i = 1, ..., n and t = 1, ..., T; T; is the 
number of measurements and ranges from 1 to 4; Age, is the age of respon- 
dent i at time t. We estimated both simple and quadratic age effects models 
and found that the latter fit the data substantially better. Consequently, the 
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present model includes both linear and quadratic terms of age. We centered 
the age variable around the median age of the 10-year cohort to which the 
person belonged obtained from Table 3.10 because cohort-median center- 
ing protects the estimate from bias associated with systematic variation in 
mean age across cohorts and is equivalent to person-mean centering, which 
facilitates interpretation (Miyazaki and Raudenbush 2000). The intercept Bo; 
is the expected health response score of person i at the median cohort age; 
Bı; is the expected linear rate of increase or growth rate per year of age for 
person i; B,; is the expected quadratic rate of increase; and e, is the random 
within-person error for person i at t and is assumed normally distributed. 
The individual growth parameters D, Bj; and B;; depend on person-level 
characteristics such as cohort membership. The level 2 model thus specifies 
a distinct average trajectory for each cohort and incorporates other time- 
invariant covariates associated with each individual: 


Level 2 model: 


For the intercept: 


Bo; = Yoo + YoCohort; ak KR + Woi (9.2) 
q 


For the linear growth rate: 


m = Yio T Yu Cohort, + Wj (9.3) 


For the quadratic growth rate: 


B»; = Yoo T YaiCohort; + Wj (9.4) 


In the model for the intercept [Equation (9.2)], Yo is the expected health 
outcome at age 88 (median) for cohort born before 1905 at zero values of other 
covariates; y, is the main effect of cohort that indicates mean difference in 
health between cohorts or intercohort variation in the mean; Yo, is the coefficient 
for person-level covariates Z,; (as discussed in the following). In models (9.3) 
and (9.4), the linear and quadratic growth rates per year of age further vary 
by cohort: Yı and ta are the expected linear and quadratic rates of change 
in health response in the 1905 cohort, respectively; yı, and y; are the age-by- 
cohort interaction effects and indicate mean differences in rates of change 
between cohorts or intercohort variation in the age effects. Finally, wj; W, and 
W,, are the residual random effects, after controlling for cohort differences, of 
person i on health and the rates of increase by age in health, respectively, and 
are assumed to have a multivariate normal distribution. The cohort effects 
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can be nonlinear and represented by cohort dummy indicators or polyno- 
mial terms. The choice of operationalization in the final model depends on 
significance tests of coefficients and comparative model fit. 

The hierarchical/multilevel linear model (HLM)-growth curve meth- 
odology has the advantage of allowing data that are unbalanced in time 
(Raudenbush and Bryk 2002). That is, it incorporates all individuals with data 
for the estimation of trajectories, regardless of the number of waves he or she 
contributes to the person-year dataset. Compared to alternative modeling 
techniques, this substantially reduces the number of cases lost to follow-up 
due to mortality or nonresponse. However, the loss-to-follow-up subsample 
needs to be distinguished from, and compared with, those with complete 
data for all waves. If mortality and nonresponse are significantly correlated 
with worse health and other key covariates of health such as age, parameter 
estimates of health trajectories may be biased if they are not controlled. To 
fully assess the potential influence of selection due to death or nonresponse, 
we control for the effects of attrition by including dummy variables indicat- 
ing the deceased and nonrespondents in the level 2 models. That is, Z,; = 
(Died, Nonresponse), for q = 1 and 2 in Equation (9.2). 

Although the application of two-level hierarchical regression models to 
the analysis of age-dependence in standard single-cohort longitudinal panel 
data is relatively straightforward, growth curve analysis of multicohort, 
multiwave data is complicated by two issues (Yang 2010). 

First, because the observable age trajectories of different cohorts initiate 
and end at different ages, cohort comparisons are based on different seg- 
ments of cohort members' life course (see, e.g., Table 3.10). When the obser- 
vations for one cohort are not overlapping with another in age, one cannot 
compare cohorts at the same ages. And in this case, age and cohort are diffi- 
cult to distinguish as earlier cohorts are older in age. This raises the question 
of the potential confounding of the age and cohort effects. Two analytic strat- 
egies help to resolve this problem. The first is using centered age variables, as 
illustrated previously in this chapter. Age centering eases the interpretation 
of the intercept, stabilizes estimation, and prevents the bias in the estimate 
that arises from systematic variation in mean age across the cohorts, hence 
eliminating the confounding of age and cohort variables. Second, the models 
yield tests of significance of overlapping segments of the life course of adja- 
cent cohorts. As waves of data accumulate, the number of overlapping ages 
of adjacent cohorts increases, which increases statistical power. In this case, 
age and cohort will become less and less confounded, making it increasingly 
possible to compare cohort differences in age trajectories. 

The second issue concerns period effects. The models specified in the 
previous discussion here do not explicitly incorporate period effects for 
both substantive and methodological reasons. First, in contrast to synthetic 
cohort designs that usually cover several decades, accelerated longitudinal 
designs typically span much shorter time periods (e.g., a decade or so). So, the 
effects of period can be assumed to be trivial and omitted from the models, 
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especially if the theoretical focus is on aging. Second, it is challenging to 
estimate a separate period effect in the framework of the growth curve mod- 
els. The level 1 analysis reflects within-individual change by modeling the 
outcome as a function of the time indicator (Singer and Willett 2003). This 
means that one can include age or wave (period) depending on substantive 
focus, but not both because within individuals age and period are the same. 
The simultaneous estimation of age and period effects creates the model 
identification problem that requires different data designs and mixed 
model specifications—the HAPC-CCREM analytic framework described in 
Chapters 7 and 8—to resolve. Third, one does not need to estimate period 
effect per se and can focus instead on the age-by-cohort interaction effects. 
Because cohorts vary in age at any historical moment, effects associated with 
historical time, if any, tend to produce cohort differences in the age-outcome 
relationship. If period effects are operating, pooling of data from all cohorts 
(or omission of cohort effects from the models) would yield biased estimates 
of age-related changes, but controlling for cohort effects and age-by-cohort 
interactions captures these changes precisely (Yang 2007). 

Based on the combined models of (9.1)- (9.4), we obtained restricted maxi- 
mum likelihood-empirical Bayes (REML-EB) parameter estimates using 
SAS PROC MIXED (see sample codes for selected tables). Additional anal- 
yses using alternative GLMM specifications such as Poisson and negative 
binomial mixed effects models did not yield substantively different results. 
Accordingly, because of their familiarity and ease of interpretation, we report 
estimates from linear mixed models (LMMs) using a normal link (sample 
codes are provided for selected models shown in Tables 9.1-9.3). 


9.2.3 Results 


Table 9.1 presents the model effect coefficients and significance tests for 
depressive symptoms, physical disabilities, and self-assessments of health. 
The intercohort differences in age trajectories of health can be clearly seen in 
Figure 9.1, which plots the expected growth trajectories of depressive symp- 
toms, physical disability, and self-rated health by cohort (these are based 
on estimates of similar models that also adjust for other covariates in the 
analysis in Section 9.4 and hence for the reference group with mean values 
of all these covariates). Results support the hypothesis that there are signifi- 
cant cohort variations in both mean levels (B) and growth rates (B;; and B;) 
of health problems, although the directions of change differ by the health 
outcome examined. Specifically, more recent cohorts suffered from more 
depression on average, as indicated by a positive cohort effect coefficient (yy) 
and higher intercepts for more recent cohorts in Figure 9.1a. On the other 
hand, they fared better in physical functioning and self-rated general health 
on average, as indicated by significant cohort effect coefficients and cohort- 
specific intercepts in Figures 9.1b) and 9.1c, respectively. The cohort effects are 
linear for the depression and self-rated health models, with each successive 
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TABLE 9.1 
Growth Curve Model Estimates of Cohort and Aging Effects on Health 


CES-D Disability ` Self-rated Health 


Coef. Coef. Coef. 
Fixed Effects Parameters (t Ratio) (t Ratio) (t Ratio) 
For Intercept Bo; 
Intercept Yoo —0.34** 2.00 *** 3.15*** 
COHORT You 0.03 ** —0.28 *** 0.10 *** 
(3.45) (-8.59) (-9.31) 
COHORT2 Yoo 0.02 *** 
(4.86) 
For Linear Growth Rate M 
Intercept Vio 0.13 ** 0.83 *** -0.08 
(2.93) (11.24) (-1.58) 
COHORT Yu —0.06 ** —0.25 *** -0.03* 
(-5.84) (6.00) (-2.37) 
COHORT? nm 0.02 *** 
(3.82) 
For Quadratic Growth Bo; 
Rate 
Intercept Yoo 0.09 ** 0.31*** 0.09 *** 
(4.69) (6.51) (4.50) 
COHORT Yn -0.06 *** 
(-5.62) 
Control Variables 
DIED Yos 0.37** 0.44*** -0.48 *** 
(9.13) (14.79) (711.44) 
NONRESPONSE Yos 0.20 *** 
(5.87) 
Random Effects—Variance Components 
Level-1: Within-person Cu 0.38** 0.223 0.38*** 
Level-2: In intercept Woi 0.44 ** 0.30 *** 0.52 *** 
In growth rate Uy 0.16** 0.15*** 0.24*** 
Goodness-of-Fit 
BIC 28100.5 22505.4 28592.3 


Note: + p «0.10; * p < 0.05; ** p < 0.01; *** p < 0.001 (two-tailed test). 
[4 p D pP 


cohort having a 0.03-unit increase in the standardized CES-D score and a 
0.10-unit increase in self-rated health on average (p < .001). The cohort effect 
is quadratic for the disability model, with each successive cohort having a 
0.28-unit lower disability score (y) that decreased at an increasing rate of 
0.02 (yo) (p < .001). 
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FIGURE 9.1 
Predicted age trajectories of three health outcomes by cohort: (a) depressive symptoms; (b) 
physical disability; (c) self-rated health. 


The results also show that health problems increased significantly with 
age, as indicated by the intercept coefficients for the linear Del and quadratic 
growth rates Gol for all three health outcomes. However, such quadratic age 
patterns were not universal across cohorts and therefore were not entirely 
developmental in nature but affected by social context as defined by cohort 
membership. The cohort effect coefficients for the growth rate models 
indicate significant age-by-cohort interaction effects that suggest different 
growth trajectory patterns across cohorts. Figure 9.1 further illustrates age 
trajectories of depression, disability, and self-rated health that were unique 
to each cohort. In more recent cohorts, depressive symptoms and disability 
increased less steeply with age, and the smaller age increases across cohorts 
were significant for both (yr, = —06, p < .01, in CES-D model; yr, = —25, 5; = .02, 
p < .001, in disability model). At the same time, while Figure 71a shows that 
more recent cohorts had lower levels of depression at the same ages than the 
earlier cohorts, Figure 9.1b shows that successive cohorts had higher levels 
of disability than their predecessors at the same ages. This may be due to an 
earlier onset of debilitating conditions in recent cohorts, such as obesity and 
diabetes. Figure 9.1c shows that more recent cohorts also reported slightly 
faster declines in perception of good general health with age than earlier 
cohorts (yr, = -.03, p < .05). 
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The last set of results on control variables suggests that the deceased had 
significantly more depressive symptoms and disabilities and worse self- 
rated health than those who completed all surveys. Nonrespondents had 
more depression but did not differ from other survivors in disability or self- 
rated health. Adjusting for the attrition status produced estimates of age tra- 
jectories and cohort effects that would otherwise be biased. 

In sum, we found strong support for the intercohort change hypothesis that 
there exist substantial intercohort variations in aging trajectories of health. 
We also found that the directions of cohort change did not consistently 
favor more recent cohorts but differed by health outcomes and parameters 
of health trajectories. On the one hand, more recent cohorts fared increas- 
ingly better in mean levels of physical functioning and self-assessments 
of health and experienced faster declines in depression and slower incre- 
ments in disability with age. This reflects the long-term health benefits of 
physiological improvements and smoking cessation across cohorts. On the 
other hand, more recent cohorts suffered from more depressive symptoms 
on average and showed faster declines in perceived general health with age 
than earlier cohorts did. As expected, this pattern partly reflects the mental 
health impacts of unique demographic and cultural experiences of different 
twentieth-century cohorts with regard to marriage patterns, family struc- 
tures, and lifestyles affecting nutrition and body mass. The finding is also 
consistent with previous research that showed that earlier cohorts born in, 
and coming of age during, the Great Depression and world wars may have 
gained more satisfaction and positive views of themselves having survived 
economic deprivations, social instability, and related social hardships (Idler 
1993). Later cohorts, in contrast, faced more complicated problems associated 
with prosperity—such as labor market competition, human relations, and 
medicalization—and may thus have manifested more mental health prob- 
lems (Yang 2007, 2008). 


en 


9.3 Intracohort Heterogeneity in Age Trajectories 
9.3.1 Hypothesis 


Previous social and epidemiologic research has found that social status indi- 
cated by sex, race, and SES manifests strong relationships with health and the 
way health changes over the life course. Whereas the female, black, and low- 
SES disadvantages in health are a general observation, the patterns of social 
disparities in health trajectories are much less clear. Although most extant 
studies suggested that the salience of sex, race, and SES in affecting health 
varies across age, they have reported different directions for such variations. 
A key factor that has led to the inconsistent understanding about aging and 
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stratification patterns is the same one that plagues studies of health trajecto- 
ries in the overall population—the confounding of aging and cohort effects. It 
remains unknown whether discrepant findings on changes in social gaps in 
health with age are actually due to birth cohort differences because few prior 
studies have simultaneously examined and effectively distinguished aging 
and cohort effects using longitudinal research designs. We argue that taking 
cohort effects into account can largely resolve the inconsistencies in previous 
findings and can provide a better test of competing theories, whereas ignor- 
ing cohort variations may bias the findings on age-related inequalities. In 
light of the intercohort change hypothesis, we further test the intracohort inequal- 
ity hypothesis that sex, race, and SES disparities in health change significantly 
over the life course, independent of cohort differences. We directly tested 
the implicit assumption on which prior studies were based, that is, health 
inequalities by sex, race, and SES change over the life course in the same way 
within each cohort and the intracohort patterns reflect aging-related phe- 
nomena that are universal across all cohorts. 

There are two major theoretical perspectives on social stratification of 
aging and health that predict different directions of change in patterns 
of stratification over the life course. The dominant explanation for increasing 
health disparities with age is the cumulative advantage/disadvantage theory. 
Originating from studies of scientific careers and status attainment mod- 
els (Merton 1968), the cumulative advantage/disadvantage theory has recently 
proliferated in life course studies of changes in the effects of personal and 
structural characteristics on various outcomes. The theory suggests that the 
effects of early advantage or disadvantage accumulate over the life course, 
thereby increasing heterogeneity within cohorts (O'Rand 2003). In the con- 
text of aging and health, it predicts that inequalities in social status and 
consequently in health status early in life amplify with age and differentiate 
individuals further as they age. A secondary explanation for the diverging 
health gaps with age has been the double-jeopardy hypothesis that old age 
and racial/ethnic minority status interact to widen black and white health 
differences, and it has also been extended to a triple-jeopardy hypothesis 
that considers the compounding effects of sexism (Ferraro and Farmer 1996). 
The prevailing explanation for decreasing health disparities with age is the 
age-as-leveler theory. It suggests that the health gaps narrow across the life 
course due to the equalization of resources in later life and the selective sur- 
vival of elite minorities who have acquired immunity against hardships in 
life (House et al. 1994; Preston, Hill, and Drevenstedt 1998). 

Extant empirical studies suggested opposing directions of change in sex, 
race, and SES gaps in health with age and hence inconsistent support for 
the theories mentioned. Regarding the sex and race gaps, some found that 
these gaps widened with age, while others found that these gaps converged 
with age, and the racial gap even exhibited a crossover in old age. Most 
extant cross-sectional or short-term longitudinal analyses of sex and race 
differences in health were concerned with individual-level as opposed to 
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aggregate data, but these analyses provided no consistent support for either 
perspective. Several recent studies that more rigorously examined how sex 
and race effects on health changed over the life course using better statistical 
models of age interactions with sex and race showed no apparent increases 
or decreases in sex or race gaps when other social factors were controlled 
(see, e.g., Yang and Lee 2009 for a review). Findings were also inconsistent 
with regard to whether the effects of SES on health outcomes strengthened 
or diminished over the life course. Education and income gaps in health 
have been found to diverge, converge, remain stable, or diverge from early 
to middle age and converge in old age (see, e.g., Chen, Yang, and Liu 2010 
for a review). One major limitation of these studies is that they did not test 
for cohort effects. Because the theoretical perspectives above all emphasized 
intracohort differentiation, a proper test should control for cohort differences 
that might otherwise be confounded with aging effects. 

While most extant studies concerned specific health outcomes, less is 
known about cumulative measures of health disorders and deficits, indicated 
by frailty, that have recently been shown to hold a particularly strong rela- 
tionship with the aging process (Mitnitski, Song, and Rockwood 2004). And, 
even less is known about population heterogeneity in the age dynamics of 
frailty. So, we supplement the analysis on the three health outcomes with one 
on the Frailty Index (FI) by extending a recent study by Yang and Lee (2010). 

For the HRS data, which include only four cohorts of older adults, we esti- 
mated age growth trajectories for each cohort separately. That is, for each 
cohort, we specified models of age change and included individual-evel 
covariates (sex, race, education, attrition dummies) in the level 2 model and 
within-person time-varying covariates (income, marital status, smoking) in 
the level 1 model. Age was centered around the cohort median and divided 
by 10. An alternative specification was to include cohort membership in the 
level 2 model as Equations (9.2)-(9.4), which we illustrate in Section 9.4 using 
the ACL data, which consist of more birth cohorts and require a parsimoni- 
ous test of cohort differences. 


9.3.2 Results 


The model estimates of fixed effects of age and other covariates are pre- 
sented in Table 9.2 for the total sample and four cohorts. For comparison, 
we show first the model for all based on the total sample that included all 
cohorts that resembles analyses in many previous studies without distin- 
guishing aging and cohort effects. It shows significant sex, race, and SES 
gaps in the age growth trajectories of frailty indicated by both mean and 
age slope (linear growth rate). However, these results do not constitute evi- 
dence for significant social disparities in aging and frailty because cohort 
effects were not controlled. Therefore, we turn to the next set of results from 
cohort-specific models. First, we found quadratic age trajectories of the FI for 
all cohorts, indicating increases in the accumulation of health deficits and 
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disorders with age that can be considered as rates of biological aging. The 
shapes of age trajectories were cohort specific, however. While the AHEAD 
(Study of Assets and Health Dynamics Among the Oldest Old), CODA 
(Children of Depression), and HRS cohorts showed accelerations in the FI 
with age, as indicated by positive quadratic growth rates, the WB (war baby) 
cohort showed a deceleration in the FI with age, as indicated by a negative 
quadratic growth rate. 

Second, we found partial support for the intracohort inequality hypoth- 
esis. Within cohorts, the effects of sex, race, education, and income were all 
highly significant on the intercepts or means of the FI. In the model for the 
AHEAD cohort, being female was associated with a 0.019-unit higher FI on 
average, net of other factors. The effect of race was stronger than that of sex: 
A nonwhite had a 0.025-unit higher FI on average. And, the effect of educa- 
tion was the strongest of all: Having less than 13 years of education increased 
the FI by 0.028. Poverty, similarly, increased one's FI score by 0.014. The strati- 
fication patterns were also observed in each other within-cohort model, sug- 
gesting that females, nonwhites, and low-SES groups suffered from a greater 
degree of physiological reserve loss at any given age than their male, white, 
and higher-SES counterparts, adjusting for all other factors. While consistent 
with the widely documented inverse relationship between social status and 
the risk of illness (Link and Phelan 1995), the racial and SES gaps in mean FI 
levels also provide new evidence that social adversity exerts strong effects 
on multiple domains of deficit accumulation simultaneously and on the com- 
plex process of physiological deregulation. 

The findings on social heterogeneity in rates of biological aging indicated 
by the growth rates of the FI, however, were more varied. When the cohort 
effect was controlled, there was no consistent support for the intracohort 
inequality hypothesis. As opposed to the results from all cohorts combined 
that showed a significant decrease in the sex gap with age (-0.004, p « .001), 
the cohort-specific models showed no significant sex effect on growth rates 
of the FI. Therefore, one would have found support for the age-as-leveler 
hypothesis regarding sex differences in frailty without adjustment for cohort 
differences. Instead, we found that sex disparities in frailty remained con- 
stant over the life course within cohorts. We found evidence for a converg- 
ing racial gap in frailty for one cohort but constant racial gaps for the other 
cohorts. The race gap decreased with age significantly only in the AHEAD 
cohort (—0.009, p < .10), as shown by the predicted age trajectories of the FI by 
race from the model in Figure 92a. 

SES disparities in frailty changed in different directions depending on indi- 
cators used. As predicted by the cumulative advantage theory, the education 
disparities increased over the life course within all cohorts, suggesting posi- 
tive effects of education on growth rates of frailty and diverging frailty trajec- 
tories of the highly educated and those with a high school degree or less. The 
education gaps showed significant increases with age within all cohorts, but 
the magnitudes of such changes varied: The divergence in the educational 
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FIGURE 9.2 
Race and SES gaps in predicted age trajectories of the Frailty Index within cohorts: (a) race gap 
in AHEAD cohort; (b) education gap in CODA cohort; (c) income gap in HRS cohort. 


gap was twice as large in the WB cohort as the AHEAD cohort. Figure 92b 
plots the predicted age trajectories by education level for the CODA cohort 
that experienced an intermediate degree of divergence. On the other hand, 
the income disparities decreased over the life course within two cohorts, as 
predicted by the age-as-leveler theory. For the AHEAD and HRS cohorts, the 
below-poverty group showed significantly slower rates of aging than others 
with higher income, leading to convergences in the trajectories of the FI, as 
shown in Figure 92c for the HRS cohort. The income gaps did not show sig- 
nificant age changes and hence remained constant for the other cohorts. 

In sum, the foregoing HAPC-GCMs analyses of intracohort heterogeneity 
in age trajectories led to the following important conclusion: The life course 
changes in health disparities that were found in previous studies were actually due 
largely to cohort differences. Taking cohort effects into account substantially 
modifies the existing understanding of the relationships between social 
inequalities and aging. The lack of consistent relationships between social 
status and rates of aging across cohorts precludes any straightforward char- 
acterization of social heterogeneity in the dynamics of frailty and aging. We 
further note that there were potential measurement limitations of the study, 
such as the dichotomous racial group categorization, which may obscure an 
even more complex pattern of inequality over the life course and necessitate 
future research. The only clear conclusion that can be drawn at this point 
is that the expression of biological aging and the accumulation of general 
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system damage did not follow the same path under different circumstances 
within a human population. In fact, individuals” slopes of change with age 
were sensitive to social conditions in which they were embedded. This is an 
important finding that challenges a prevailing assumption in many medi- 
cal and genetic studies that there exist common paths and determinants of 
health declines and aging across individuals. It provides another example 
of the powerful interplay between biological and environmental forces that 
shape the divergent developmental paths of aging organisms. 


9.4 Intercohort Variations in Intracohort Heterogeneity Patterns 
9.4.1 Hypothesis 


The intercohort change hypothesis refers to changes in overall cohort means, 
and the intracohort inequality hypothesis refers to whether sex and race groups 
within cohorts become increasingly heterogeneous with age. It is logical to 
ask further how the patterns of within-cohort heterogeneity differ across 
cohorts. Therefore, we next examine intercohort variations in intracohort 
social disparities in health trajectories. 

A large body of sociological research has shown that sex and race inequali- 
ties in health largely result from differences in power, prestige, social sta- 
tus, socially learned lifestyles, behaviors, roles, and stress (Verbrugge 
1989; Williams 2005). The secular trends of these risk factors may not have 
occurred in a parallel manner across male and female cohorts and across 
black and white cohorts. Changes in sex- and race-specific exposures to these 
factors across birth cohorts may thus contribute to intercohort variations in 
intracohort sex and race disparities in health trajectories over the life course. 
Cohort membership therefore provides an important social structural con- 
text that conditions the cumulative advantage/disadvantage or leveling pro- 
cess. If sex and race gaps in major risk factors for health have decreased in 
more recent cohorts, then the corresponding health gaps may diminish. This 
suggests sex and race gaps in mean levels of health narrow in more recent 
cohorts. In addition, sex and race gaps in growth rates of health problems 
may also change across cohorts. The patterns of divergence or convergence 
with age may become stronger or weaker or even change directions in more 
recent cohorts. 

There is evidence that the trends of increasing education effects over the life 
course (or larger education gaps in older ages) have strengthened in more recent 
cohorts in the United States (House, Lantz, and Herd 2005; Lynch 2003). A most 
recent study by Chen, Yang, and Liu (2010) used the methodology introduced 
in the previous and further discussion in this chapter to show a more compli- 
cated picture. The effect of education on mean level of health decreased across 
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six 10-year cohorts born between 1890 and 1970 in China, but the educational 
gap remained constant with age. The income gap in health trajectories diverged 
for earlier cohorts but converged for most recent cohorts. In sum, the intercohort 
difference in intracohort inequality hypothesis states that patterns of change across 
the life course in social disparities in health trajectories vary by cohort. 


9.4.2 Model Specification 


We illustrate the model specifications using the ACL data with a focus on sex 
and race disparities in health, which can easily be extended to include other 
stratification variables, such as SES. Building on the same level 1 model in 
Equation (9.1) and the level 2 model in Equations (9.2)- (9.4), we further assess 
cohort differences in the associations between social status and age trajec- 
tories. That is, at level 2, we specify interaction effects between birth cohort 
and other person-level characteristics such as sex and race for models of each 
growth parameter. 


Level 2 model: 
For the intercept: 


Bo; = Yoo + YoiCohort; =F Yo2Sex; + YosRace; ck Yo Cohort -Sex 


(9.5) 
+ YosCohort - Race + Wo; 
For the linear growth rate: 
Bi; = Yi + YuCohort, + ¥12Sex; + Y13Race; + 'y Cohort - Sex 
(9.6) 


+ Yu Cohort: Race + wy; 


In the model for the intercept (9.5), Yo. and es are the sex and race effect 
coefficients respectively that indicate intracohort differences by sex and race 
in the mean level of health, yo, and Yos are interaction effect coefficients that 
indicate intercohort differences in the intracohort sex and race effects on the mean 
level of health. In the model for the linear growth rate (9.6), y,; and yı; represent 
the intracohort sex and race differences in age change in health, respectively, 
and y; and te are coefficients for three-way interactions between cohort, age, 
and sex/race that indicate intercohort differences in the intracohort sex and race 
effects on age change. A similar model for the quadratic growth rate can be 
specified. Control variables can be entered at level 1 for time-varying covari- 
ates (such as income) and level 2 for time-constant covariates (such as educa- 
tion and attrition dummies). All continuous variables are centered in order 
for the intercept to be substantively meaningful. 
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9.4.3 Results 


There is support for intercohort variations in intracohort sex and race dif- 
ferences in mean health levels but not in growth rates of health problems 
except for depression. Table 9.3 shows significant cohort-by-sex interaction 
effects on the intercepts of CES-D (0.03, p < .05) and disability models (-0.03, 
p < .001), suggesting that sex gaps in depression widened across cohorts, 
whereas the gaps in disability decreased across cohorts. Figure 9.3 shows the 
predicted mean levels of CES-D scores and disability by cohort for men and 
women based on these estimates that adjusted for age and all other factors. 
While the female disadvantage in mental health increased in more recent 
cohorts (Figure 93a), the female disadvantage in disability declined in more 
recent cohorts (Figure 9.3b). 

There were also significant cohort differences in racial gaps in mean health 
outcomes, net of all other factors. The cohort-by-race interaction effects in 
the intercept models of CES-D and self-rated health indicated that race gaps 
widened across cohorts. Figure 94 presents the predicted mean CES-D and 
self-rated health scores by race from the models that show divergences in 
the black and white gaps for more recent cohorts. Black cohorts experienced 
steeper increases in mean CES-D scores in each successive cohort and continu- 
ous decreases in self-rated health. Adjusting for all other factors reduced the 
magnitudes of the cohort effect coefficients in the intercept model for disabil- 
ity and diminished the significance level in the intercept model for self-rated 
health. Therefore, intercohort variations in mean levels of disability and self- 
assessments of health can be largely explained by cohort differences in SES, 
marital status, chronic illness, obesity, and mental health. With self-assessments 
of health, cohort differences in patterns of smoking also played a role. 

The cohort-by-sex and cohort-by-race interaction effects were not signifi- 
cant in the models of growth rates when the cohort, sex, and race effects were 
controlled and thus are omitted. Combining results from the models of inter- 
cept and growth rates, we find that significant intercohort differences existed 
in changes of intracohort sex gaps in depression but not in changes of intra- 
cohort sex or race gaps in disability or self-rated health. Therefore, these gaps 
in growth trajectories were constant within cohorts. This is expected given 
the results from the tests of intracohort inequality hypothesis, which indi- 
cated that a significant intracohort sex gap in growth rates was found only 
in depression, controlling for cohort effects. Figure 9.5 shows the predicted 
sex-specific age trajectories of depression for select cohorts based on the 
final model in Table 9.3. Consistent with previous findings from Yang (2007), 
whereas the gross age effects in Table 9.1 indicate increases in depression 
with age, the net depression trajectories showed decreases with age within 
cohorts after adjusting for the effects of SES, marital status, and physical ill- 
ness. Furthermore, in support of the current hypothesis, the within-cohort 
sex gap did not uniformly increase or decrease across the life course but 
varied by cohort. For instance, the sex gap first diverged in the earliest cohort 
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Growth Curve Model Estimates of Cohort Differences in Sex and Race 


Effects on Age Trajectories of Health 


CES-D Disability Self-Rated Health 

Fixed Effects Coef.? Coef. Coef. 
For Intercept 
Intercept 0.16 1.50*** 3.34*** 
COHORT 0.06* —0.17*** 0.03 
COHORT? 0.02 *** 
SEX (1=female) —0.07 0.19*** 0.02 
RACE (1-black) -0.12 0.06 0.07 
COHORT * SEX 0.03* —0.03* -0.01 
COHORT * RACE 0.07 *** -0.01 -0.04* 
For Linear Growth Rate 
Intercept 0.05 0.73 *** —0.07 
COHORT —0.03** -0.28 *** —0.03* 
COHORT? 0.02 *** 
SEX (1=female) —0.08* 0.04 0.04 
RACE (1=black) -0.01 0.06 -0.05 
For Quadratic Growth Rate 
Intercept 0.09 *** 0.33 *** 0.07 *** 
COHORT —0.06 *** 
Control Variables 
DIED 0.07* 0.35 *** —0.29*** 
NONRESPONSE 0.09** 
EDUC —0.04*** —0.02 *** 0.05*** 
INCOME 0.00* 0.00 + 0.00 *** 
NOTMARRIED 0.24*** 0.04* 0.02 
ILLNESS 0.07*** 0.16*** -0.26*** 
BMI (ref. = normal) 

UWEIGHT 0.22 *** 0.22 *** -0.46*** 

OWEIGHT -0.07 ** —0.02 -0.05* 

OBESE -0.07* 0.05* -0.24*** 
SMOKE 0.00 *** 0.00 -0.01*** 
CES-D -0.17*** 
DISABLE 0.20*** 0.11 *** —0.32*** 
HEALTH —0.17*** —0.18 *** 
Random Effects—Variance Components 
Level-1: Within-person 0.36 *** 0.22*** 0.37*** 
Level-2: In intercept 0.27*** 0.21*** 0.33*** 


continued 
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TABLE 9.3 (continued) 


Growth Curve Model Estimates of Cohort Differences in Sex and Race 
Effects on Age Trajectories of Health 


CES-D Disability Self-Rated Health 
Goodness-of-Fit Coef.? Coef. Coef. 
In growth rate 0.14*** 0.14*** 0.24*** 
BIC 26733.5 21838 27416.5 


Note: + p < 0.10; * p < 0.05; ** p < 0.01; *** p < 0.001 (two-tailed test). 
a tratios are omitted due to space constraints; they are available upon request. 


born before 1905, with men having more depressive symptoms than women, 
then crossed over in the 1915-1924 cohort, converged in the War Babies, and 
became largely constant in the baby boomers. Overall, this showed a trend of 
decreasing degrees of or lessening convergence in female and male depres- 
sion trajectories in more recent cohorts. 

Comparisons of estimates of the random effects in Tables 9.1 and 9.3 sug- 
gest that the inclusion of cohort, sex, and race effects largely reduced the 
level 2 variances in the intercepts for models of all three health outcomes. 
And, adjusting for all covariates in Table 9.3 also decreased the Bayesian 
information criteria (BICs) and hence improved the model fit. The residual 
variances in the growth rates were only slightly reduced, suggesting addi- 
tional factors to consider in future research. 

Returning to the HRS analysis of four cohorts of older adults, we present 
the p values of tests of cohort differences in sex, race, and SES effects and all 
others included in the model in the rightmost column of Table 9.2. They are 
associated with the interaction terms of cohort and these variables. As noted 
in the introduction of datasets in Chapter 3, additional waves of the HRS, 
as compared to the ACL, increased the power of the HAPC-GCMs in tests 
of aging-related effects within each cohort, but the fewer HRS cohorts may 
decrease the power of tests for cohort differences. Nonetheless, the results 
showed significant cohort differences in the overall intercept and growth 
rates, supporting the intercohort change hypothesis. Different from the ACL 
analysis on individual health outcomes, the HRS analysis suggested no sig- 
nificant intercohort variations in intracohort sex and race gaps in frailty. 
However, there were significant cohort differences in the education and 
income effects on the mean, suggesting intercohort variations in intracohort 
SES inequalities in levels of frailty. The positive effect of poverty on frailty, in 
particular, increased in more recent cohorts. No cohort effects were found on 
the social disparities in growth rates. This is consistent with the results from 
the ACL analysis. The two studies jointly suggested that social disparities 
in the aging-related health processes existed in weak forms in the context of 
cohort experiences. This is strong evidence for Norman Ryder's argument 
that cohort is a unique structural category that has explanatory power above 
and beyond other social stratification systems. 
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FIGURE 9.3 
Predicted mean health outcomes by birth cohort: sex gap: (a) depressive symptoms; (b) physical 


disability. (Figure 9.3a adapted from Yang and Lee, 2009: Figure 1A; Figure 9.3b adapted from 
Yang and Lee 2009: Figure 2A.) 
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Predicted mean health outcomes by birth cohort: race gap: (a) depressive symptoms; (b) self- 
rated health. (Figure 9.4a adapted from Yang and Lee, 2009: Figure 1B; Figure 9.4b adapted 
from Yang and Lee, 2009: Figure 3.) 
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FIGURE 9.5 
Predicted age growth trajectories of depressive symptoms by cohort: sex gap. (Adapted from 
Yang and Lee, 2009: Figure 4.) 


9.5 Summary 


We have shown the use of the GLMMs framework in longitudinal studies of 
prospective cohort panel data to address some long-standing questions in the 
stratification of aging. We empirically evaluated the proposition that consid- 
ering the process of cohort change is important for the theory, measurement, 
and analysis of social inequalities in health over the life course. We found 
substantial evidence that supported this proposition based on a systematic 
investigation of the distinct role of aging and cohort in the relationships 
between social status defined by sex, race, and SES and comorbidities of phys- 
ical and mental problems and frailty. We found that changes in health dis- 
parities across the life course were largely cohort-related phenomena. We also 
identified major mechanisms by which cohort differences in various health 
outcomes were realized. The findings prompt more careful and thorough 
examinations of various aging-related hypotheses in a cohort-specific context. 

The first and most foundational component of longitudinal cohort analysis 
using the mixed effects statistical model concerns how cohort effects condi- 
tion overall health trajectories over the life course. If there exist substantial 
intercohort variations in aging trajectories of health, then health and social 
disparities therein are due in part to social historical changes exogenous to 
developmental/age changes in physical and mental states. This highlights 
the necessity of examining cohort effects for a proper attribution of aging 
effects in life course research. 
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The second kind of analysis concerns how cohort effects alter existing 
explanations of social disparities in health trajectories over the life course. 
We did not find evidence for diverging or converging intracohort inequali- 
ties in disability, self-rated health, or frailty net of intercohort differences in 
health trajectories. Therefore, neither the cumulative advantage nor the age- 
as-leveler theory explained the mechanisms generating the life course pat- 
terns of sex and race disparities in these health outcomes because the sex and 
race gaps were persistent and constant across ages within cohorts. On the 
other hand, the diverging education gaps in frailty with age for all cohorts 
of U.S. older adults strongly supported the cumulative advantage theory. 
We have clearly shown that examination of cohort effects in the longitudi- 
nal growth curve analyses provided more conceptual clarity and a better 
empirical test of competing aging theories. The strong presence of interco- 
hort differences as opposed to age differences in sex and race gaps in health 
suggests that we focus on cohort-related explanations. 

The third kind of analysis subsumes the first two and assesses how 
cohorts differ in patterns of intracohort changes in heterogeneity in health 
trajectories. Intercohort differences can exist in the means or the growth 
rates (age change) or both. Persistent and growing social disparities in mean 
levels of health across cohorts may indicate a lack of improvement and pos- 
sible adverse changes in the cohort-related exposures to risk factors for the 
disadvantaged groups. For instance, women born after World War II expe- 
rienced unanticipated increases in social stress associated with balancing 
work and family, marital dissolution, and economic deprivation in female- 
headed households that decreased their perceived sense of health well-being. 
Racial discrimination in terms of residential segregation and economic iso- 
lation also continues to affect blacks' quality of life in ways that may mat- 
ter more for their mental than physical health. Intercohort differences in 
the social gaps in growth rates of health problems mean that the salience of 
social stratification in health not only varies across the life course but also 
varies across cohort. For instance, the patterns of age changes in sex gap in 
depression were strongly modified by and dependent on birth cohort. We 
saw less convergence in the male-female gap in depression with age in more 
recent cohorts, suggesting a weaker age-as-leveler process in these cohorts. 
In the case of no significant changes in the intracohort heterogeneity in the 
age trajectories but substantial intercohort variations in social gaps in mean 
levels of health (as for disability and self-rated health), we conclude that any 
changes with age observed in previous studies or studies that ignored cohort 
effects were actually due to intercohort differences in health. That is, there 
was no stronger cumulative advantage process or age-as-eveler effect at 
work in later or earlier cohorts when the confounding effects of aging and 
cohort were distinguished and other social factors controlled. 

In conclusion, an assessment of the distinct roles of aging and cohort 
effects in the study of social inequalities in health may provide a better 
understanding of biological and social structural factors that condition 
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health inequalities over the life course and help researchers better evalu- 
ate the explanatory power of competing theories. Cohort membership 
contextualizes individuals' health change with age and conditions social 
inequalities in health. Cohort effects also seem to be essential for moder- 
ating sex inequality in age trajectories for certain health indicators such as 
depression. The significance of cohort change in shaping social inequali- 
ties in health over the life course implies the relevance of both biological 
forces and historical context, and it suggests emerging new patterns for 
future cohorts entering adulthood and old age as a result of their new life 
circumstances. The identification of cohort-specific patterns of inequali- 
ties in psychiatric morbidity and physical functioning thus has important 
implications for public health and finance as the largest cohort in U.S. his- 
tory—the baby boomers—reaches the 65+ ages in the 18 years after 2010. 
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10 


Directions for Future Research 
and Conclusion 


10.1 Introduction 


Rates of disease, vital status, and social measurements arrayed over time 
by age or cohorts are canonical organizations of data in demographic, epi- 
demiologic, and social research. A vast literature over the past four decades 
has used conventional linear accounting/multiple classification models 
and statistical methodology for age-period-cohort (APC) analysis. In this 
context, different solutions to the model identification problem often pro- 
duced ambiguous and inconsistent results and ignited continuous, never- 
ending debates on whether any solutions exist or which solutions are better. 
Controversies that have appeared in the literature have been exacerbated by 
the inadequacy of existing APC models and methods. We have noted that the 
APC identification problem is an instance of a more general class of struc- 
tural underidentification problems in the social sciences. Thus, advances in 
models and methods for APC analysis may also facilitate innovations on 
other instances of structural underidentification. 

The developments introduced in previous chapters highlight new approaches 
to this problem—or simply new ways of thinking about cohort analysis. We 
have reached the insight that the identification problem is model specific 
rather than data specific (Fu 2008). The early work of Mason and colleagues 
(1973: 246) actually made a similar observation on the origin of the model 
identification problem: "[The estimation of the accounting model] is problem- 
atic because the relationships between age, period and cohort have the same 
functional form as the expected relationship of each of these independent 
variables to the dependent variable, Y. That is, we cannot estimate unique 
effects for age, period and cohort because we postulate each variable to be linearly 
related to Y and at the same time assume that A, P and C are linearly related to each 
other" (italics added). Because the use of conventional linear accounting mod- 
els (and hence the identification problem) characterizes the majority of previ- 
ous studies, investigators erroneously concluded that the problem is inherent 
in APC analysis. The exclusive focus on the tabular rates data design in the 
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early literature contributed to this confusion because the A, P, and C vari- 
ables in this setting are indeed exactly linear. Often, however, it is the linear 
accounting models, not the APC data structure, that incur the identification 
problem. The quotation from Mason et al. (1973) implied that there would not 
be such a problem if either or both of the following conditions are met: (1) A, 
P, and C are not linearly related to each other; and (2) it is not postulated that 
each variable be linearly related to Y. A deficiency in previous studies on this 
topic is that they are almost exclusively focused on finding solutions to break 
condition 1 while paying no attention to condition 2. In this book, we have 
provided a more comprehensive set of approaches and tools that deal with 
both. That is, we have shown the use of multilevel data to satisfy the first con- 
dition, developed alternative modeling approaches via the hierarchical APC 
(HAPC) models to satisfy the second condition, and have also shown how 
the combination of both approaches can be the most effective way of account- 
ing for aging-related phenomena and social and demographic change. 

Beyond the identification problem, we have also shown how applications 
of different classes of models in the generalized linear mixed models frame- 
work to the same or similar datasets yield highly consistent models and 
substantive inferences across all three research designs: tables of rates or 
proportions, repeated cross-sectional surveys, and accelerated longitudinal 
panel designs. In addition to model validation through simulations, we 
have provided empirical evidence that validated findings from applications 
of the Intrinsic Estimator (IE) for the conventional APC linear accounting 
model. Specifically, as shown by our comparisons of the estimated age, 
period, and cohort temporal trends for the General Social Survey (GSS) ver- 
bal ability and Surveillance, Epidemiology, and End Results (SEER) Program 
data, the IE and HAPC analyses lead to similar conclusions. The HAPC 
models further allow for the test of contextual characteristics and additional 
explanatory hypotheses. The growth curve analysis of prospective cohort 
data supplements the former two approaches and enables analysts to draw 
inferences from true birth cohorts. Therefore, these members of the general- 
ized linear mixed model (GLMM) family are complementary to each other 
and jointly form a comprehensive set of tools that can extend the reach of 
cohort analysis to a wide variety of problems not before possible. 

Our joining together of these three classes of models for APC analysis 
within the larger family of GLMMs opens up the possibility for further sta- 
tistical methodological developments. For instance, there have been recent 
and continuing developments in statistical methodology for GLMMs and 
related software programs toward more flexible specifications on functional 
forms (e.g, spline functions or other non- or semiparametric functional 
forms) for temporal dependence and nonnormal specifications on the sta- 
tistical distributions of the random effects. Given the GLMM framework 
for APC analysis that we have exposited in this book, it can be anticipated 
that these methodological developments will be applied to HAPC models 
when they are motivated by new empirical applications or by the desire to 
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assess robustness of empirical findings by the estimation of such models 
with alternative specifications. New model specifications for the analysis of 
hidden heterogeneity (e.g., finite mixture models) within APC data also may 
be developed. These are just a few examples of the many possibilities for 
methodological refinements that lie ahead. 

While we have featured GLMMs for APC analysis in this book, we do 
not claim exclusivity of GLMMs to unify and organize new developments 
of APC models. There are other recent methodological developments using 
different approaches. We give examples in this chapter of additional mod- 
els and data designs that provide promising avenues for future develop- 
ment and substantive research. 


10.2 Additional Models 
10.2.1 The Smoothing Cohort Model and Nonparametric Methods 


We described in Chapter 3 the first research design using the contingency 
age-by-period tables of rates. In this most frequently used design in APC 
analysis, birth cohorts are defined by the diagonals of the rectangular array. 
When age and period are measured in intervals that are longer than 1 year, 
the adjacent cohorts overlap. For example, in Table 3.1, which presents the 
lung cancer incidence rate data for 5-year age groups across 5-year periods, 
the diagonals represent 9-year birth cohorts that overlap for 4 years: the 1915 
cohort covering birth years 1911-1919 overlaps with the 1920 cohort cover- 
ing birth years 1916—1924 for the years between 1916 and 1919. This has been 
largely ignored by the conventional linear models, which treat cohorts as 
mutually exclusive categories and include cohort effects as additive to fixed 
age and period effects. The observation of overlapping birth cohorts unique 
to this data design, however, is the motivation for a different kind of model. 

Fu (2008) proposed a smoothing cohort model that utilizes the fact that the 
overlap of adjacent cohorts requires that each cohort effect be estimated with 
contributions from near neighbors. This model replaces the fixed cohort 
effects in the APC accounting model [Equation (4.1) of Chapter 4] with 
smoothed cohort effects through a nonparametric spline smoothing func- 
tion. Since the temporal variations in this model apply to the cohort effects 
and leave the age and period as fixed effects, it is a semiparametric model 
and a member of the family of partially linear models, which are an exten- 
sion of GLMMs. Interestingly, the smoothing on the cohort effects introduces 
mild binding to the parameters; breaks up the linear dependency between 
the age, period and cohort effects; and effectively avoids the identification 
problem. The mild binding by the nonparametric smoothing through spline 
smoothing on the cohort effects is just large enough to break the linear 
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dependency and small enough not to introduce much bias, which ensures 
consistent estimation of age and period effects. Fu then introduced a two- 
stage smoothing model. The first stage is to obtain consistent estimates of the 
age and period effects from the smoothing cohort model. The second stage 
is to apply the consistent estimates to the fixed effect of the APC accounting 
model to achieve consistent estimates for age, period, and cohort estimates. 
Fu further demonstrated through simulation studies that this two-stage 
model yielded consistent estimation and sensible results for analyses of 
crime rates and lung cancer mortality rates. 


10.2.2 The Continuously Evolving Cohort Effects Model 


It is well known that the conventional APC linear accounting model suf- 
fers from the identification problem methodologically. Less well appre- 
ciated is that the model also suffers from a conceptual problem that 
becomes obvious if we revisit the model of Equation (4.1) in Chapter 4. It 
can be seen that the model rests on key assumptions that do not always 
accurately describe most APC-related phenomena. It is a model of addi- 
tive effects. It assumes that the effect of age a; is the same for periods j 
and cohorts k. But, the influence of age may change over time and across 
cohorts. Consider, for instance, the dramatic declines in infant mortality 
over the past century. It also assumes that the effect of period p; is the 
same for people of all ages. However, period effects are often age spe- 
cific. For example, the influenza epidemic of 1918 caused especially high 
mortality among people in their teens and 20s. Similarly, it assumes that 
the effect of cohort k is the same as long as the cohort lives. But cohorts 
must change, as Norman Ryder (1965: 861) explained in his seminal article 
about the nature of the cohort process: 


The case for the cohort as a temporal unit in the analysis of social change 
rests on a set of primitive notions: persons of age a in time t are those 
who were age a - 1 in time t - 1; transformations of the social world 
modify people of different ages in different ways; the effects of these 
transformations are persistent. 


That is, as opposed to experiencing fixed impacts of events, cohorts are 
continuously exposed to events whose influences accumulate over the life 
course. Wars and epidemics are examples of such events that may break 
out in the middle of a cohort's life and leave an imprint on all of its subse- 
quent behaviors and outcomes. New events constantly occur. A model with 
unchanging cohort effects is appropriate only if all relevant events occur 
before the initial observation and only if these events' impacts stay fixed as 
the cohort ages (Hobcraft, Menken, and Preston 1982). To capture the pro- 
cess by which cohort effects are generated as described by Ryder, however, 
one needs a more general model—a framework that Hobcraft, Menken, and 
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Preston (1982) labeled “continuously accumulating cohort effects” but lacked 
procedures for investigation at the time of their review. 

Schulhofer-Wohl and Yang (2011) recently addressed this gap. They devel- 
oped a general model that relaxes the assumption of the conventional addi- 
tive model. The new model allows age profiles to change over time and 
period effects to differ for people of different ages. The model also defines 
cohort effects as an accumulation of age-by-period interactions over all 
events across the life course. Although a long-standing literature on theories 
of social change conceptualizes cohort effects in exactly this way, this is the 
first time that a method of statistically modeling for this more complex form 
of cohort effects has been presented. Applying this new model to analyze 
changes in age-specific mortality rates in Sweden over the past 150 years, 
the authors found that the model fit the data dramatically better than the 
additive model. The analyses also yielded interesting results that showed 
the utility of this model in testing competing theories about the evolution 
of human mortality. The flexibility of the model, however, comes at a high 
computational cost because it involves the estimation of a large number of 
parameters. The inclusion of additional covariates also has not been ana- 
lytically attempted. This new approach thus presents both opportunities and 
challenges for future analysts. When further improved, it may find applica- 
tions in many areas of research. 


10.3 Longitudinal Cohort Analysis of Balanced 
Cohort Designs of Age Trajectories 


The multicohort, multiwave data design is especially important for aging and 
cohort analysis. Although several longitudinal surveys using this design are 
available, such as the Health and Retirement Survey and the National Long 
Term Care Survey, cohort studies would benefit from further developments 
in data collection. 

The usual accelerated longitudinal cohort design has two limitations: 
(1) Some ages cannot be observed for all cohorts, and (2) coverage of the indi- 
vidual life course and historical time is extremely restricted. The imbalance 
of the age-by-cohort structure, as illustrated by Figure 3.3 and Table 3.10 of 
Chapter 3, arises from the fact that the baseline survey consists of cohorts of 
different ages and follow-up surveys occurred at the same times. As a result, 
cohorts aged for exactly the same number of years but will remain age het- 
erogeneous at the end of data collection. The growth curve models applied to 
these data therefore yield estimates of cohort differences in age trajectories 
based only on the overlapping age groups of adjacent cohorts rather than the 
entire possible range of ages. This could affect the accuracy of the estimates 
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when the number of waves is small or the overlapping age intervals are few. 
Increasing the number of follow-ups alleviates problems for inference but is 
less than perfect for the purposes of disentangling aging effects from birth 
cohort differences and observing period effects. 

A better design is one in which the age-by-cohort data structure is bal- 
anced and extends for a long time. Extant secondary data that meet these 
criteria are exceedingly rare. But, with the help of modern computing tech- 
nologies that aid the compilation of historical records, longitudinal data that 
cover a more extended segment of the life course are emerging and worthy 
of attention. The Liaoning Multigenerational Panel (LMGP) data (Campbell 
and Lee 2009), for example, may be an exceptional resource for cohort stud- 
ies. The LMGP is a database of at least 1 million observations of 200,000 
individuals from eighteenth to early twentieth century Chinese population 
registers. It provides entire life histories for men and nearly complete life his- 
tories for women. The length of the historical period spanned and the sheer 
number of observations included make it a great candidate for future lon- 
gitudinal research on aging. Another example is the Peking Union Medical 
College Hospital (PUMCH) birth records data on over 2,000 individuals born 
in 1921-1954 who were followed up during 2003-2005 with clinical exami- 
nations and interviews (Zeng et al. 2010). In this case, all individuals had 
detailed obstetric records at birth, so the data on health changes within each 
cohort cover large age spans from birth to middle or old ages. Although 
the data are unbalanced to the right at the same follow-up year, they are 
balanced to the left and include 50 years of ages shared by all cohorts. If 
additional follow-ups were to be conducted, the studies will become even a 
greater source for studying age trajectories of health. 

Data like these are difficult to collect and compile but can be extremely 
useful for a variety of substantive investigations. For example, the mecha- 
nisms underlying persistent cohort differences in mortality need to be better 
understood. The “cohort morbidity phenotype” hypothesis has been pro- 
posed to link large cohort improvement in survival to reductions in expo- 
sures to infections, inflammation, and increased nutrition in early life (Finch 
and Crimmins 2004). But, evidence of an association between early-life con- 
ditions and late-life mortality is solely based on aggregate population data 
from developed countries and needs further testing using individual life 
histories across multiple birth cohorts in other national populations, such as 
those in the LMGP and PUMCH. A related question that can benefit from a 
better cohort design is the debate about whether chronic disease at adult ages 
is determined by “programming” in utero or childhood or whether early- 
life factors are mediated through the lifelong accumulation of risk factors. 
The fetal origin hypothesis, which supports the former scenario, has been 
widely used to explain the mechanisms underlying adult diseases (Barker 
1998). One challenging issue is that it is extremely difficult to sort out the 
confounding effects of life course mediators without data that connect child- 
hood with old ages. It is necessary to collect information on biological and 
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social factors from cohorts in infancy to older adulthoods. Birth record data 
with long-term follow-up designs hold the potential for vastly enhancing our 
ability to estimate various age and cohort models to shed light on this debate. 


10.4 Conclusion 


The APC problem has intrigued and frustrated scientists for decades. Since 
the last synthesis of statistical models and methods for APC analysis in the 
social sciences by Mason and Fienberg (1985), there have been substantial 
developments in statistical theory, methods, and computational algorithms. 
One example, on which we have based the models and methods described in 
the previous chapters, is the generalization of the class of generalized linear 
models to the class of GLMMs. We think these innovations in statistics are 
sufficiently important that they support, indeed almost require, rethinking 
the problem of APC analysis. 

In its simplest terms, this is what we seek to accomplish. This book sum- 
marizes the developments of new models and methods for APC analysis that 
we and our collaborators have initiated. The technique of the IE for modeling 
APC tabular data helps one to obtain statistically sound solutions to the iden- 
tification problem that has long compromised previous APC analyses when 
linear models or generalized linear models are applied to aggregate tabu- 
lar data or contingency tables. And, the techniques of the HAPC-CCREM 
(cross-classified random effects model) and HAPC-GCM (growth curve 
model) analysis help one to avoid the identification problem and address 
additional theoretical questions using the GLMM and multilevel microdata 
on synthetic or true cohorts. We have successfully applied these models and 
methods to the empirical analysis of data in sociology, demography, and epi- 
demiology that generated important substantive findings regarding histori- 
cal and future trends in cancer morbidity and mortality and social disparities 
in aging, frailty, and overall well-being. These techniques, coupled with new 
and superior data sources, set the stage for a new era of cohort analysis that 
will enhance our understanding of the complex interplay of human aging, 
social epidemiologic conditions, and historical events and processes. 

We also think that the statistical models and methods described in the 
previous chapters and the associated software will be sufficiently accessible 
to many demographers, epidemiologists, and social scientists that they can 
be widely applied to APC data in the form of tables of rates or proportions, 
repeated cross-sectional sample surveys, and accelerated longitudinal panel 
studies. Indeed, we see evidence of this already in the empirical studies that 
recently have been published or soon will appear. These studies should pro- 
duce new empirical knowledge of many phenomena or processes in demog- 
raphy, epidemiology, and the social sciences. 
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Our expectation is that, over the years to come, this will result in the accu- 
mulation of knowledge of classes of phenomena that are primarily cohort 
driven (such as the GSS verbal test score data analyzed in Chapters 5-8) 
versus those that are primarily period driven (such as the National Election 
Survey voter turnout data analyzed by Frenk, Yang, and Land in 2012) ver- 
sus those that are affected by elements of both temporal dimensions (such 
as the cancer incidence and mortality analyses of Chapters 5-8), as well as, 
perhaps, the identification of some phenomena that are dominated by age 
effects and relatively immune to cohort or period effects. As this knowl- 
edge accumulates, it should lead to the development of concepts and theo- 
ries to explain these findings. And, in addition, as the main primary direct 
effects of the age, period, and cohort temporal dimensions are identified and 
classified, researchers may then focus on more subtle interactions between, 
say, social structural variables and the temporal dimensions. This, again, 
will stimulate theoretical developments. Specifically, this accumulation of 
empirical knowledge should, in turn, lead to new conceptual-theoretical 
innovations about how demographic, epidemiological, social, economic, 
political, and cultural processes develop and evolve over time along the age, 
period, and cohort dimensions of time, which can, itself, lead to additional 
empirical studies. 
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